a 
as : 
at cs 
Caachi 
4; a4 
iain 
sen anieetnin eas 
ee C iy 
i 
oh st 
Facts 
seas at 
oxtes a 
uhh 
sett Sah ace 


Pe 
i? 
ce 


rm 
53 


Ste 
te 
cnt, 


eres 
ote 

eaeh 
see 


e 
5 


a3 
ie 


St 


re 
aeaee' 

eeeeetss 

neste! 


tue! 


COFFEES, 
pests Min erect 
Pergenetcelece 

errurkeces 

ror, 


ete 
re! 


ie 
< 


re 
©: 


st 


e 


Z 
2 


et 


2 
eeeccnrer, 
reteset 


< 


ce 
te 
rae 


es 
f 


exo 
ecett 


| 
¢ ee 


a 


s 
© 
bs 


et 

‘i 
e! 
7 


> 
5 


Bis 


*) 


\ 


3 
e 


e 


Ty 


ei 


+5 


cote . 
cocnceceteee 
: 
2 
. 
3 


u 
xt 


5 


siete 
eerste 
* 
aS 


ra 

a2 

ior 
33 


ee 


a 


, 
S 


a4 


Se 


st 


3 
re 


35 


3 


oe 


Pye eC 
5 


a 
tet 


eS 


rie 
ete 


ss 


o 
3 
cs 


up: 
ie 


cs 
x 


Eee 


i? 
+! 


npect ite 


2% 


ptorotn tat, 


ce 
th 

Tet retet 
2 ps5 


ee, 


53 


aoe 
tec 
Stetesetes 


sceretete 
Tease soho tare 


= 
ss 


oe, 


Te, 


erate 


cr, 


it 


335 


* 
teres 


Sale 


ar 


= 


Ao 


Sots 


Sethe 
rete 

ees 

a ne 


Sete 
3 


Bete 
oa 


2S 


Se! 


ae 


S 
mt 


tel 
3 


See 


Sons 


ao 


eas 


25 


a 


oe 


eseces 


a5 


Sos 


Sent 


it 


“i 


a 


i 
nie 


Sh 


He 


rif 


( 
ae 
pa 


Fup 


" 


at 


“ 


EX LIBRIS 


on 
ln 
an 
as 
oy 
Z 
ae 
eo) 
re 
i 
n 


ea 
S) 
5) 
4 
— 
je) 
eo) 


Digitized by the Internet Archive 
in 2022 with funding from 
Kahle/Austin Foundation 


al 


httos://archive.org/details/largescalestructOO0O0hawk 


CAMBRIDGE MONOGRAPHS 
ON MATHEMATICAL PHYSICS 
General Editors: W.H.McCrea, D.W.Sciama, J.C. Polkinghorne 


THE LARGE SCALE STRUCTURE 
OF SPACE-TIME 


(t 


THE LARGE SCALE STRUCTURE 
OF SPAGE=PIME 


S.W. HAWKING, F.RS. _ 
Institute of Astronomy and Gonville and Caius College, Cambridge 
AND 
G. F.R. ELLIS 


Department of Applied Mathematics, University of Capetown 


CAMBRIDGE UNIVERSITY PRESS 


Published by the Syndies of the Cambridge University Press 
Bentley House, 200 Euston Road, London NW1 2DB 
American Branch: 32 East 57th Street, New York, N.Y. 10022 


© Cambridge University Press 1973 
Library of Congress Catalogue Card Number: 72—93671 


ISBNs: 
0 521 20016 4 hard covers 
0 521 09906 4 paperback 


First published 1973 
First paperback edition 1974 


Printed in Great Britain 
at the University Printing House, Cambridge 
(Brooke Crutchley, University Printer) 


To 
D.W.SCIAMA 


Contents 


Preface 


1 


2 

Ze 
2.2 
2.3 
2.4 
2.5 
2.6 
yas) 
2.8 
20 


3 

3.1 
3.2 
3.3 
3.4 


4 

4.1 
4.2 
4.3 
4.4 
4.5 


The role of gravity 


Differential geometry 

Manifolds 

Vectors and tensors 

Maps of manifolds 

Exterior differentiation and the Lie derivative 
Covariant differentiation and the curvature tensor 
The metric 

Hypersurfaces 

The volume element and Gauss’ theorem 

Fibre bundles 


General Relativity 

The space-time manifold 
The matter fields 
Lagrangian formulation 
The field equations 


The physical significance of curvature 
Timelike curves 

Null curves 

Energy conditions 

Conjugate points 

Variation of arc-length 


5 Exact solutions 


5.1 
5.2 
5.3 
5.4 


Minkowski space-time 

De Sitter and anti-de Sitter spacetimes 
Robertson—Walker spaces 

Spatially homogeneous cosmological models 


{ vii ] 


page xi 
1 


10 
ul 
15 
22 
24 
30 
36 
44 
47 
50 


56 
56 
59 
64 
71 


78 
78 
86 
88 
96 
102 


117 
118 
124 
134 
142 


CONTENTS 


The Schwarzschild and Reissner—Nordstr6m 
solutions 

The Kerr solution 

Gédel’s universe 

Taub-NUT space 

Further exact solutions 


Causal structure 

Orientability 

Causal curves 

Achronal boundaries 

Causality conditions 

Cauchy developments 

Global hyperbolicity 

The existence of geodesics 

The causal boundary of space-time 
Asymptotically simple spaces 


The Cauchy problem in General Relativity 


The nature of the problem 

The reduced Einstein equations 
The initial data 

Second order hyperbolic equations 


page 149 
161 
168 
170 
178 


180 
181 
182 
186 
189 
201 
206 
213 
217 
221 


226 
227 
228 
231 
233 


~ 


The existence and uniqueness of developments for 


the empty space Einstein equations 
The maximal development and stability 
The Einstein equations with matter 


Space-time singularities 

The definition of singularities 
Singularity theorems 

The description of singularities 
The character of the singularities 
Imprisoned incompleteness 


Gravitational collapse and black holes 
Stellar collapse 

Black holes 

The final state of black holes 


244 
249 
254 


256 
256 
261 
276 
284 
289 


299 
299 
308 
323 


CONTENTS 


10 The initial singularity in the universe page 348 
10.1 The expansion of the universe 348 
10.2 The nature and implications of singularities 359 

Appendix A: 

Translation of an essay by P. 8. Laplace 365 

Appendix B: 

Spherically symmetric solutions and Birkhoff’s 

theorem 369 
References 373 
Notation 381 


Index 385 


Preface 


The subject of this book is the structure of space-time on length- 
scales from 10-%cm, the radius of an elementary particle, up to 
10cm, the radius of the universe. For reasons explained in 
chapters 1 and 3, we base our treatment on Einstein’s General 
Theory of Relativity. This theory leads to two remarkable pre- 
dictions about the universe: first, that the final fate of massive 
stars is to collapse behind an event horizon to form a ‘black hole’ 
which will contain a singularity; and secondly, that there is a 
singularity in our past which constitutes, in some sense, a begin- 
ning to the universe. Our discussion is principally aimed at developing 
these two results. They depend primarily on two areas of study: first, 
the theory of the behaviour of families of timelike and null curves in 
space-time, and secondly, the study of the nature of the various 
causal relations in any space-time. We consider these subjects in 
detail. In addition we dévelop the theory of the time-development 
of solutions of Einstein’s equations from given initial data. The dis- 
cussion is supplemented by an examination of global properties of 
a variety of exact solutions of Einstein’s field equations, many of 
which show some rather unexpected behaviour. 

This book is based in part on an Adams Prize Essay by one of us 
(S. W. H.). Many of the ideas presented here are due to R. Penrose 
and R. P. Geroch, and we thank them for their help. We would refer 
our readers to their review articles in the Battelle Rencontres (Penrose 
(1968)), Midwest Relativity Conference Report (Geroch (1970c)), 
Varenna Summer School Proceedings (Geroch (1971)), and Pittsburgh 
Conference Report (Penrose (19726)). We have benefited from dis- 
cussions and suggestions from many of our colleagues, particularly 
B. Carter and D. W. Sciama. Our thanks are due to them also. 


Cambridge . S. W. Hawking 
January 1973 G. F. R. Ellis 
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I} 
The role of gravity 


The view of physics that is most generally accepted at the moment is 
that one can divide the discussion of the universe into two parts. First, 
there is the question of the local laws satisfied by the various physical 
fields. These are usually expressed in the form of differential equations. 
Secondly, there is the problem of the boundary conditions for these 
equations, and the global nature of their solutions. This involves 
thinking about the edge of space-time in some sense. These two parts 
may not be independent. Indeed it has been held that the local laws 
are determined by the large scale structure of the universe. This view 
is generally connected with the name of Mach, and has more recently 
been developed by Dirac (1938), Sciama (1953), Dicke (1964), Hoyle 
and Narlikar (1964), and others. We shall adopt a less ambitious 
approach: we shall take the local physical laws that have been experi- 
mentally determined, and shall see what these laws imply about the 
large scale structure of the universe. 

There is of course a large extrapolation in the assumption that the 
physical laws one determines in the laboratory should apply at other 
points of space-time where conditions may be very different. If they 
failed to hold we should take the view that there was some other 
physical field which entered into the local physical laws but whose 
existence had not yet been detected in our experiments, because it 
varies very little over a region such as the solar system. In fact most of 
our results will be independent of the detailed nature of the physical 
laws, but will merely involve certain general properties such as the 
description of space-time by a pseudo-Riemannian geometry and the 
positive definiteness of energy density. 

The fundamental interactions at present known to physics can be 
divided into four classes: the strong and weak nuclear interactions, 
electromagnetism, and gravity. Of these, gravity is by far the weakest 
(the ratio Gm?/e? of the gravitational to electric force between two 
electrons is about 10-*°). Nevertheless it plays the dominant role in 
shaping the large scale structure of the universe. This is because the 
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strong and weak interactions have a very short range (~ 10-1 cm or 
less), and although electromagnetism is a long range interaction, the 
repulsion of like charges is very nearly balanced, for bodies of macro- 
scopic dimensions, by the attraction of opposite charges. Gravity on 
the other hand appears to be always attractive. Thus the gravitational 
fields of all the particles in a body add up to produce a field which, for 
sufficiently large bodies, dominates over all other forces. 

Not only is gravity the dominant force on a large scale, but it is a 
force which affects every particle in the same way. This universality 
was first recognized by Galileo, who found that any two bodies fell 
with the same velocity. This has been verified to very high precision 
in more recent experiments by Eotvos, and by Dicke and his collabo- 
rators (Dicke (1964)). It has also been observed that light is deflected 
by gravitational fields. Since it is thought that no signals can travel 
faster than light, this means that gravity determines the causal 
structure of the universe, i.e. it determines which events of space-time 
can be causally related to each other. 

These properties of gravity lead to severe problems, for if a suffi- 
ciently large amount of matter were concentrated in some region, it 
could deflect light going out from the region so much that it was in fact 
dragged back inwards. This was recognized in 1798 by Laplace, who 
pointed out that a body of about the same density as the sun but 
250 times its radius would exert such a strong gravitational field that 
no light could escape from its surface. That this should have been 
predicted so early is so striking that we give a translation of Laplace's 
essay in an appendix. 

One can express the dragging back of light by a massive body more 
precisely using Penrose’s idea of a closed trapped surface. Consider 
a sphere 7 surrounding the body. At some instant let 7 emit a flash 
of light. At some later time t, the ingoing and outgoing wave fronts 
from J will form spheres 7, and 7, respectively. In a normal situa- 
tion, the area of 7, will be less than that of 7 (because it represents 
ingoing light) and the area of Z, will be greater than that of Z 
(because it represents outgoing light; see figure 1). However if a suffi- 
ciently large amount of matter is enclosed within .7, the areas of a 
and J, will both be less than that of 7. The surface.T is then said to 
be a closed trapped surface. As ¢ increases, the area of 7, 2 Will get 
smaller and smaller provided that gravity remains attractive, ie. pro- 
vided that the energy density of the matter does not become negative. 
Since the matter inside 7 cannot travel faster than light, it will be 
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trapped within a region whose boundary decreases to zero within a 
finite time. This suggests that something goes badly wrong. We shall 
in fact show that in such a situation a space-time singularity must 
occur, if certain reasonable conditions hold. 

One can think of a singularity as a place where our present laws of 
physics break down. Alternatively, one can think of it as representing 
part of the edge of space-time, but a part which is at a finite distance 
instead of at infinity. On this view, singularities are not so bad, but one 
still has the problem of the boundary conditions. In other words, one 
does not know what will come out of the singularity. 


Figure 1. At some instant, the sphere 7 emits a flash of light. At a later time, 
the light from a point p forms a sphere Y around p, and the envelopes 7, and 
J , form the ingoing and outgoing wavefronts respectively. If the areas of both 
J ,and J, are less than the area of 7, then 7 is a closed trapped surface. 


There are two situations in which we expect there to be a sufficient 
concentration of matter to cause a closed trapped surface. The first is 
in the gravitational collapse of stars of more than twice the mass of 
the sun, which is predicted to occur when they have exhausted their 
nuclear fuel. In this situation, we expect the star to collapse to a singu- 
larity which is not visible to outside observers. The second situation is 
that of the whole universe itself. Recent observations of the microwave 
background indicate that the universe contains enough matter to 
cause a time-reversed closed trapped surface. This implies the exist- 
ence of a singularity in the past, at the beginning of the present epoch 
of expansion of the universe. This singularity is in principle visible to 
us. It might be interpreted as the beginning of the universe. 


4 THE ROLE OF GRAVITY 


In this book we shall study the large scale structure of space-time 
on the basis of Einstein’s General Theory of Relativity. The predic- 
tions of this theory are in agreement with all the experiments so far 
performed. However our treatment will be sufficiently general to cover 
modifications of Einstein’s theory such as the Brans—Dicke theory. 

While we expect that most of our readers will have some acquain- 
tance with General Relativity, we have endeavoured to write this 
book so that it is self-contained apart from requiring a knowledge of 
simple calculus, algebra and point set topology. We have therefore 
devoted chapter 2 to differential geometry. Our treatment is reason- 
ably modern in that we have formulated our definitions in a manifestly 
coordinate independent manner. However for computational con- 
venience we do use indices at times, and we have for the most part 
avoided the use of fibre bundles. The reader with some knowledge of 
differential geometry may wish to skip this chapter. 

In chapter 3 a formulation of the General Theory of Relativity is 
given in terms of three postulates about a mathematical model for 
space-time. This model is a manifold W with a metric § of Lorentz 
signature. The physical significance of the metric is given by the first 
two postulates: those of local causality and of local conservation of 
energy-momentum. These postulates are common to both the General 
and the Special Theories of Relativity, and so are supportéd by the 
experimental evidence for the latter theory. The third postulate, the 
field equations for the metric §, is less well experimentally established. 
However most of our results will depend only on the property of the 
field equations that gravity is attractive for positive matter densities. 
This property is common to General Relativity and some modifications 
such as the Brans—Dicke theory. 

In chapter 4, we discuss the significance of curvature - considering 
its effects on families of timelike and null geodesics. These represent 
the paths of small particles and of light rays respectively. The curva- 
ture can be interpreted as a differential or tidal force which induces 
relative accelerations between neighbouring geodesics. If the energy— 
momentum tensor satisfies certain positive definite conditions, this 
differential force always has a net converging effect on non-rotating 
families of geodesics. One can show by use of Raychaudhuri’s equation 
(4.26) that this then leads to focal or conjugate points where neigh- 
bouring geodesics intersect. 

To see the significance of these Foie points, consider a one-dimen- 
sional surface in two-dimensional Euclidean space (figure 2). Let p 
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be a point not on /. Then there will be some curve from ¥ to p which 
is shorter than, or as short as, any other curve from F to p. Clearly 
this curve will be a geodesic, i.e. a straight line, and will intersect £ 
orthogonally. In the situation shown in figure 2, there are in fact three 
geodesics orthogonal to. which pass through p. The geodesic through 
the point r is clearly not the shortest curve from ¥ to p. One way of 
recognizing this (Milnor (1963)) is to notice that the neighbouring 


FIcuRE 2. The line pr cannot be the shortest line from p to Y, because there is 
a focal point g between p and +. In fact either px or py will be the shortest line 
from p to F. % 


geodesics orthogonal to through u and vw intersect the geodesic 
through r at a focal point q between ¥ and p. Then joining the segment 
uq to the segment gp, one could obtain a curve from ¥ to p which had 
the same length as a straight line rp. However as ugp is not a straight 
line, one could round off the corner at g to obtain a curve from F to p 
which was shorter than rp. This shows that rp is not the shortest curve 
from ¥ to p. In fact the shortest curve will be either xp or yp. 

One can carry these ideas over to the four-dimensional space-time 
manifold @ with the Lorentz metric g. Instead of straight lines, one 
considers geodesics, and instead of considering the shortest curve one 
considers the longest timelike curve between a point p and a spacelike 
surface Y (because of the Lorentz signature of the metric, there will 
be no shortest timelike curve but there may be a longest such curve). 
This longest curve must be a geodesic which intersects.Y orthogonally, 
and there can be no focal point of geodesics orthogonal to. Y between 
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SF and p. Similar results can be proved for null geodesics. These results 
are used in chapter 8 to establish the existence of singularities under 
certain conditions. 

In chapter 5 we describe a number of exact solutions of Einstein’s 
equations. These solutions are not realistic in that they all possess 
exact symmetries. However they provide useful examples for the suc- 
ceeding chapters and illustrate various possible behaviours. In 
particular, the highly symmetrical cosmological models nearly all 
possess space-time singularities. For a long time it was thought that 
these singularities might be simply a result of the high degree of 
symmetry, and would not be present in more realistic models. It will 
be one of our main objects to show that this is not the case. 

In chapter 6 we study the causal structure of space-time. In Special 
Relativity, the events that a given event can be causally affected by, 
or can causally affect, are the interiors of the past and future light 
cones respectively (see figure 3). However in General Relativity the 
metric g which determines the light cones will in general vary from 
point to point, and the topology of the space-time manifold W need 
not be that of Euclidean space #4. This allows many more possibilities. 
For instance one can identify corresponding points on the surfaces 
fF, and F, in figure 3, to produce a space-time with topology FA? x S!. 
This would contain closed timelike curves. The existence*of such a 
curve would lead to causality breakdowns in that one could travel into 
one’s past. We shall mostly consider only space—times which do not 
permit such causality violations. In such a space-time, given any 
spacelike surface /, there is a maximal region of space-time (called 
the Cauchy development of) which can be predicted from knowledge 
of data on #. A Cauchy development has a property (‘Global hyper- 
bolicity’) which implies that if two points in it can be joined by a time- 
like curve, then there exists a longest such curve between the points. 
This curve will be a geodesic. 

The causal structure of space-time can be used to define a boundary 
or edge to space-time. This boundary represents both infinity and the 
part of the edge of space-time which is at a finite distance, i.e. the 
singular points. 

In chapter 7 we discuss the Cauchy problem for General Relativity. 
We show that initial data on a spacelike surface determines a unique 
solution on the Cauchy development of the surface, and that in a 
certain sense this solution depends continuously on the initial data. 
This chapter is included for completeness and because it uses a number 
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Future light cone 


Past light cone 


Ficure 3. In Special Relativity, the light cone of an event p is the set of all 
light rays through p. The past of = is the interior of the past hght cone, and the 
future of p is the interior of the future light cone. 


of results of the previous chapter. However it is not necessary to read 
it in order to understand the following chapters. 

In chapter 8 we discuss the definition of space-time singularities. 
This presents certain difficulties because one cannot regard the singular 
points as being part of the space-time manifold .#. 

We then prove four theorems which establish the occurrence of 
space-time singularities under certain conditions. These conditions 
fall into three categories. First, there is the requirement that gravity 
shall be attractive. This can be expressed as an inequality on the 
energy-momentum tensor. Secondly, there is the requirement that 
there is enough matter present in some region to prevent anything 
escaping from that region. This will occur if there is a closed trapped 
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surface, or if the whole universe is itself spatially closed. The third 
requirement is that there should be no causality violations. However 
this requirement is not necessary in one of the theorems. The basic 
idea of the proofs is to use the results of chapter 6 to prove there must 
be longest timelike curves between certain pairs of points. One then 
shows that if there were no singularities, there would be focal points 
which would imply that there were no longest curves between the pairs 
of points. 

We next describe a procedure suggested by Schmidt for constructing 
a boundary to space-time which represents the singular points of 
space-time. This boundary may be different from that part of the 
causal boundary (defined in chapter 6) which represents singularities. 

In chapter 9, we show that the second condition of theorem 2 of 
chapter 8 should be satisfied near stars of more than 14 times the solar 
mass in the final stages of their evolution. The singularities which occur 
are probably hidden behind an event horizon, and so are not visible 
from outside. To an external observer, there appears to be a ‘black 
hole’ where the star once was. We discuss the properties of such black 
holes, and show that they probably settle down finally to one of the 
Kerr family of solutions. Assuming this to be the case, one can place 
certain upper bounds on the amount of energy which can be extracted 
from black holes. In chapter 10 we show that the second conditions of 
theorems 2 and 3 of chapter 8 should be satisfied, in a time-reversed 
sense, in the whole universe. In this case, the singularities are in our 
past and constitute a beginning for all or part of the observed universe. 

The essential part of the introductory material is that in § 3.1, §3.2 
and §3.4. A reader wishing to understand the theorems predicting the 
existence of singularities in the universe need read further only chap- 
ter 4, §6.2-$6.7, and § 8.1 and § 8.2. The application of these theorems 
to collapsing stars follows in §9.1 (which uses the results of appen- 
dix B); the application tothe universe as a whole is given in § 10.1, and 
relies on an understanding of the Robertson—Walker universe models 
($5.3). Our discussion of the nature of the singularities is contained 
in § 8.1, §8.3-§8.5, and § 10.2; the example of Taub-NUT space (§ 5.8) 
plays an important part in this discussion, and the Bianchi I universe 
model (§ 5.4) is also of some interest. 

A reader wishing to follow our discussion of black holes need read 
only chapter 4, §6.2-§ 6.6, § 6.9, and § 9.1, §9.2 and § 9.3. This discus- 
sion relies on an understanding of the Schwarzschild solution ($5.5) 
and of the Kerr solution (§ 5.6). 


THE ROLE OF GRAVITY 9 


Finally a reader whose main interest is in the time evolution 
properties of Einstein’s equations need read only §6.2-§6.6 and 
chapter 7. He will find interesting examples given in §5.1, §5.2 and 
§ 5.5. 

We have endeavoured to make the index a useful guide to all the 
definitions introduced, and the relations between them. 


2 
Differential geometry 


The space-time structure discussed in the next chapter, and assumed 
through the rest of this book, is that of a manifold with a Lorentz 
metric and associated affine connection. 

In this chapter, we introduce in § 2.1 the concept of a manifold and 
in § 2.2 vectors and tensors, which are the natural geometric objects 
defined on the manifold. A discussion of maps of manifolds in §2.3 
leads to the definitions of the induced maps of tensors, and of sub- 
manifolds. The derivative of the induced maps defined by a vector 
field gives the Lie derivative defined in §2.4; another differential 
operation which depends only on the manifold structure is exterior 
differentiation, also defined in that section. This operation occurs in 
the generalized form of Stokes’ theorem. 

An extra structure, the connection, is introduced in §2.5; this 
defines the covariant derivative and the curvature tensor. The connec- 
tion is related to the metric on the manifold in § 2.6; the curvature 
tensor is decomposed into the Wey] tensor and Ricci tensor, which are 
related to each other by the Bianchi identities. 

In the rest of the chapter, a number of other topics in differential 
geometry are discussed. The induced metric and connection on a 
hypersurface are discussed in § 2.7, and the Gauss—Codacci relations 
are derived. The volume element defined by the metric is introduced 
in §2.8, and used to prove Gauss’ theorem. Finally, we give a brief 
discussion in § 2.9 of fibre bundles, with particular emphasis on the 
tangent bundle and the bundles of linear and orthonormal frames. 
These enable many of the concepts introduced earlier to be reformu- 
lated in an elegant geometrical way. §2.7 and § 2.9 are used only at 
one or two points later, and are not essential to the main body of the 
book. 


[ 10] 


Ze1] MANIFOLDS tg 


2.1 Manifolds 


A manifold is essentially a space which is locally similar to Euclidean 
space in that it can be covered by coordinate patches. This structure 
permits differentiation to be defined, but does not distinguish intrin- 
sically between different coordinate systems. Thus the only concepts 
defined by the manifold structure are those which are independent of 
the choice of a coordinate system. We will give a precise formulation 
of the concept of a manifold, after some preliminary definitions. 

Let R” denote the Euclidean space of n dimensions, that is, the set 
of all n-tuples (x1, x?, ...,2") (—00 < xt < o) with the usual topology 
(open and closed sets are defined in the usual way), and let 4.” denote 
the ‘lower half’ of R”, i.e. the region of R” for which z! < 0. A map ¢ of 
an open set (0 < R” (respectively 4”) to an open set 0’ < R™ (respec- 
tively 1.2”) is said to be of class C’ if the coordinates (x'1, x, ..., 2’™) of 
the image point ¢(p) in ©’ are r-times continuously differentiable 
functions (the rth derivatives exist and are continuous) of the co- 
ordinates (x}, x?,...,x”) of p in ©. If a map is C’ for all r > 0, then it is 
said to be C~. By a C° map, we mean a continuous map. 

A function f on an open set © of R” is said to be locally Lipschitz if 
for each open set Y < © with compact closure, there is some constant 
K such that for each pair of points p,qeY, |f(p)—f(q)| < K |p—4q|, 


where by |p| wemean 


{(a(p))? + (2(p))? +... + (a(p))?}. 
A map ¢ will be said to be locally Lipschitz, denoted by C1, if the 
coordinates of 4(p) are locally Lipschitz functions of the coordinates 
of p. Similarly, we shall say that a map ¢ is C’~ if it is C’ and if the 
(r—1)th derivatives of the coordinates of ¢(p) are locally Lipschitz 
functions of the coordinates of p. In the following we shall usually only 
mention C’, but similar definitions and results hold for C’-. 

If Fis an arbitrary set in R” (respectively 42”), a map ¢ from F to 
a set P’ < R™ (respectively 4R”) is said to be a C” map if ¢ is the 
restriction to AZ and # of a C* map from an open set @ containing 7 
to an open set 0’ containing F’. 

A Cr n-dimensional manifold M is a set M together with a C* atlas 
{M,,6,}, that is to say a collection of charts (%,, o,) where the %, are 
subsets of W and the ¢, are one-one maps of the corresponding %, to 
open sets in R” such that 

(1) the %, cover -M, i.e. M= UY,, 

= 
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(2) if Yn U, is non-empty, then the map 
Da °) Py: Pi(U, q Us) 6 PU, a) U;) 


is a C* map of an open subset of R” to an open subset of R” (see figure 4). 

Each %, isa local coordinate neighbourhood with the local coordinates 
x* (a = 1ton) defined by the map ¢, (i.e. ifp¢%,, then the coordinates 
of p are the coordinates of ¢,(p) in R”). Condition (2) is the requirement 
that in the overlap of two local coordinate neighbourhoods, the 
coordinates in one neighbourhood are C” functions of the coordinates 
in the other neighbourhood, and vice versa. 


Re $a(Va 0 &p) 


Se, iy 4 — bal Ua 0 sp) 


Figure 4. In the overlap of coordinate neighbourhoods Y, and Y,, coordinates 
are related by a C’ map ¢,0¢,71. 


Another atlas is said to be compatible with a given C” atlas if their 
union is a C" atlas for all .@. The atlas consisting of all atlases com- 
patible with the given atlas is called the complete atlas of the manifold; 
the complete atlas is therefore the set of all possible coordinate 
systems covering -#. 

The topology of -@ is defined by stating that the open sets of W 
consist of unions of sets of the form Y, belonging to the complete atlas. 
This topology makes each map ¢, into a homeomorphism. 

A C’ differentiable manifold with boundary is defined as above, on 
replacing ‘R”’ by ‘$R”’. Then the boundary of W, denoted by 0%, is 
defined to be the set of all points of M whose image under a map ¢, lies 
on the boundary of $f” in R”. é. is an (n — 1)-dimensional C* manifold 
without boundary. 
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These definitions may seem more complicated than necessary. How- 
ever simple examples show that one will in general need more than one 
coordinate neighbourhood to describe a space. The two-dimensional 
Huclidean plane R? is clearly a manifold. Rectangular coordinates 
(t, y; —0 <x< 00, —0 <y <0) cover the whole plane in one 
coordinate neighbourhood, where ¢ is the identity. Polar coordinates 
(r,@) cover the coordinate neighbourhood (r > 0, 0 < 6 < 27); one 
needs at least two such coordinate neighbourhoods to cover R?. The 
two-dimensional cylinder C? is the manifold obtained from R? by identi- 
fying the points (x,y) and (x+27,y). Then (z, y) are coordinates in 
a neighbourhood (0 < x < 27, ~co <y<«) and one needs two 
such coordinate neighbourhoods to cover C?. The Mébius strip is the 
manifold obtained in a similar way on identifying the points (a, y) and 
(z+ 27, —y). The unit two-sphere S? can be characterized as the surface 
in #? defined by the equation (x1)? + (x?)? + (23)? = 1. Then 


(Gare — 1 <2 < 1, — 1 < 2 < 1) 


are coordinates in each of the regions z! > 0, z! < 0, and one needs six 

such coordinate neighbourhoods to cover the surface. In fact, it is not 

possible to cover S? by a single coordinate neighbourhood. The 

n-sphere S” can be similarly defined as the set of points 
aye... ie) = 1 

nol eae o 

A manifold is said td be orientable if there is an atlas {Y%,, ¢,} in the 
complete atlas such that in every non-empty intersection Y, 1 U%,, the 
Jacobian |éx*/dx'3| is positive, where (z},...,2”) and (2’1,...,”’") are 
coordinates in Y, and %, respectively. The Mobius strip is an example 
of a non-orientable manifold. 

The definition of a manifold given so far is very general. For most 
purposes one will impose two further conditions, that 4 is Hausdorff 
and that .@ is paracompact, which will ensure reasonable local 
behaviour. 

A topological space -@ is said to be a Hausdorff space if it satisfies 
the Hausdorff separation axiom: whenever p, g are two distinct points 
in -%, there exist disjoint open sets Y¥,¥ in -@ such that peY, qev. 
One might think that a manifold is necessarily Hausdorff, but this is 
not so. Consider, for example, the situation in figure 5. We identify the 
points b, b’ on the two lines if and only if x, = y, < 0. Then each point 
is contained in a (coordinate) neighbourhood homeomorphic to an 
open subset of #1. However there are no disjoint open neighbourhoods 
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(y = Yo’) (y = 0) 


Ficure 5. An example of a non-Hausdorff manifold. The two lines above are 
identical for x = y < 0. However the two points a (x = 0) and a’(y = 0) are 
not identified. 


U,V satisfying the conditions aeY, a’ e¥ , where a is the point x = 0 
and a’ is the point y = 0. 

An atlas {Y,, ¢,} is said to be locally finite if every point pe 4 has 
an open neighbourhood which intersects only a finite number of the 
sets W,. M is said to be paracompact if for every atlas {W,,¢,} there 
exists a locally finite atlas {V;, y,} with each Y, contained in some %,. 
A connected Hausdorff manifold is paracompact if and only if it has 
a countable basis, i.e. there is a countable collection of open sets such 
that any open set can be expressed as the union of members of this 
collection (Kobayashi and Nomizu (1963), p. 271). 

Unless otherwise stated, all manifolds considered will be paracompact, 
connected C@ Hausdorff manifolds without boundary. It will turn out 
later that when we have imposed some additional structure on .@ (the 
existence of an affine connection, see § 2.4) the requirement of para- 
compactness will be automatically satisfied because of the other 
restrictions. 

A function f on a C¥ manifold -# is a map from .# to R!. It is said to 
be of class C’ (r < k) ata point p of .@, if the expression fo d,—! of f on 
any local coordinate neighbourhood YW, is a Cr function of the local 
coordinates at p; and f is said to be a C’ function ona set ¥° of if 
jis a C* function at each point peY . 

A property of paracompact manifolds we will use later, is the fol- 
lowing: given any locally finite atlas {Y,,¢,} on a paracompact O* 
manifold, one can always (see e.g. Kobayashi and Nomizu (1963), 
p. 272) find a set of C* functions g, such that 

(1) O<g, < 1 on J, for each a; 

(2) the support of g,, i.e. the closure of the set {pe.W: 9,(p) + 0}, is 
contained in the corresponding Z, ; 

(3) 9.{p) = 1, for all pew. 
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Such a set of functions will be called a partition of unity. The result 
is in particular true for C® functions, but is clearly not true for analytic 
functions (an analytic function can be expressed as a convergent 
power series in some neighbourhood of each point p € .4, and go is zero 
everywhere if it is zero on any open neighbourhood). 

Finally, the Cartesian product x @ of manifolds x, & is a mani- 
fold with a natural structure defined by the manifold structures of 
3, B: for arbitrary points pe.W, qe&, there exist coordinate neigh- 
bourhoods %, ¥ containing p, respectively, so the point (p,q)e wv xB 
is contained in the coordinate neighbourhood Y x ¥_ in . x B which 
assigns to it the coordinates (x*, y’), where x* are the coordinates of p 
in Y and y/ are the coordinates of g in ¥. 


2.2. Vectors and tensors 


Tensor fields are the set of geometric objects on a manifold defined in 
a natural way by the manifold structure. A tensor field is equivalent 
to a tensor defined at each point of the manifold, so we first define 
tensors at a point of the manifold, starting from the basic concept of 
a vector at a point. 

A C* curve A(t) in Mis a C* map of an interval of the real line R! into 
AM. The vector (contravariant vector) (4/ét),|,, tangent to the C1! curve 
A(t) at the point A(f,) is the operator which maps each C" function f at 
A(t)) into the number (@f/ét),| ,,; that is, (2f/0t), is the derivative of f in 


the direction of A(t) with respect to the parameter ¢. Explicitly, 


(Z), 


The curve parameter ¢t clearly obeys the relation (0/ét),¢ = 1. 
Tf (x1, ...,2”) are local coordinates in a neighbourhood of p, 


(2) _ 2 dxi(At)| af| _ dai af 
at), 


taty 2% Iai4,) dé da 
(Here and throughout this book, we adopt the summation convention 
whereby a repeated index implies summation over all values of that 
index.) Thus every tangent vector at a point p can be expressed as 
a linear combination of the coordinate derivatives 


(GIGS | ener (a) (aaa |e 


Conversely, given a linear combination V4(d/éz4)|,, of these operators, 
where the V’ are any numbers, consider the curve A(‘) defined by 


= lim + {f(A(t +8) —fA)} (2.1) 


t s—0 


to 4 =1 dt 


Ato) 
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xi(A(t)) = 2i(p)+tV!, for t in some interval [ —e, €]; the tangent vector 
to this curve at p is V3(d/éx’)|,,. Thus the tangent vectors at p form 
a vector space over R! spanned by the coordinate derivatives (0/éx’)|_,, 
where the vector space structure is defined by the relation 


(aX+BY)f = a(Xf)+ AYP) 


which is to hold for all vectors X, Y, numbers a, # and functions f. 
The vectors (@/éx/), are independent (for if they were not, there 
would exist numbers V’ such that V7(é/éx’)|,, = 0 with at least one V’ 
non-zero; applying this relation to each coordinate z* shows 


Vi oxtidn — VE = 0; 


a contradiction), so the space of all tangent vectors to -@ at p, denoted 
by 7,,(4) or simply 7, is an n-dimensional vector space. This space, 
representing the set of all directions at p, is called the tangent vector 
space to @ at p. One may think of a vector Ve7, as an arrow at p, 
pointing in the direction of a curve A(t) with tangent vector V at p, 
the ‘length’ of V being determined by the curve parameter t through 
the relation V(¢) = 1. (As V is an operator, we print it in bold type; 
its components V’, and the number V(f) obtained by V acting on a 
function f, are numbers, and so are printed in italics.) 

If {E,} (@ = 1 to n) are any set of n vectors at p which ate linearly 
independent, then any vector Ve7Z;, can be written V = VE, where 
the numbers {V%} are the components of V with respect to the basis 
{E,} of vectors at p. In particular one can choose the E, as the coordi- 
nate basis (@/@x*)|,,; then the components V? = V(2‘) = (dx*/dé)|,, are 
the derivatives of the coordinate functions x‘ in the direction V. 

A one-form (covariant vector) w at p is a real valued linear function 
on the space 7, of vectors at p. If X is a vector at p, the number into 
which w maps X will be written (w, X); then the linearity implies that 


(w,aX+ BY) = a{w, X)+ Bw, Y) 


holds for all «,#¢R! and X,Ye7,. The subspace of T,, defined by 
(w, X> = (constant) for a given one-form w, is linear. One may there- 
fore think of a one-form at p as a pair of planes in T, such that if 
<w, X) = 0 the arrow X lies in the first plane, and if iG 
touches the second plane. 

Given a basis {E,} of vectors at p, one can define a unique set of 
m one-forms {E*} by the condition: E‘ maps any vector X to the 
number X* (the ith component of X with respect to the basis {E,}). 
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Then in particular, (E7,E,) = 6%,. Defining linear combinations of 
one-forms by the rules 


(aw + py, X) = aw, X)+ fn, X) 

for any one-forms w, n and any a, fe R}, Xe T,,, one can regard {E*%} 
as a basis of one-forms since any one-form w at p can be expressed as 
w = w, E* where the numbers w, are defined by w, = (w, E;). Thus the 
set of all one forms at p forms an n-dimensional vector space at p, the 
dual space T’*,, of the tangent space 7;,. The basis {E*} of one-forms is 
the dual basis to the basis {E,} of vectors. For any we T*,, XeT, one 
can express the number (w, X) in terms of the components w,, X* of 
w, X with respect to dual bases {E%}, {E,} by the relations 


{w, X) = (ow; E', XIE, = w,X*. 
Each function f on -@ defines a one-form df at p by the rule: for 
each vector X, <df,X) = Xf. 


df is called the differential of f. If (x1, ...,x”) are local coordinates, the 
set of differentials (dx!, dz’, ...,dx”) at p form the basis of one-forms 
dual to the basis (@/éz1, 0/éx?, ..., é/ex") of vectors at p, since 


{dax*, é/éxt) = dx*/dxi = d%,. 
In terms of this basis, the differential df of an arbitrary function / is 
given by df = (éf/éx*) dat. 


If df is non-zero, the surfaces {f = constant} are (n — 1)-dimensional 
manifolds. The subspace of 7), consisting of all vectors X such that 
(df, X) = 0 consists of all vectors tangent to curves lying in the 
surface {f = constant} through p. Thus one may think of df as a 
normal to the surface {f = constant} at p. If a + 0, «df will also be 
a normal to this surface. 

From the space 7, of vectors at p and the space T*, of one-forms 
at p, we can form the Cartesian product 


eye x cay pe 
Mee Pt ce x a el hee Las 
ae ee 


r factors s factors 


i.e. the ordered set of vectors and one-forms (y},...,", Yi, ---, Ys) 
where the Ys and ys are arbitrary vectors and one-forms respectively. 

A tensor of type (7, 8) at p is a function on I]; which is linear in each 
argument. If T is a tensor of type (7, s) at p, we write the number into 
which T maps the element (y},...,”, Y1,..., Ys) of II; as 


Dn eee, Ya noe OF 
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Then the linearity implies that, for example, 
Tea}, 2:0 OSE BY, Va, --, Y,) = Cel oe 


Ee yi a Vee 
holds for all a, fe R1 and X, YeT,. 
The space of all such tensors is called the tensor product 


T"(p) = T,®...@T, @ T*,®.-.@T*y. 
—_—__—_——" eee eae ane 


r factors s factors 


In particular, 7{(p) = J, and T9(p) = T*,. 
Addition of tensors of type (r,s) is defined by the rule: (T + T’) is the 
tensor of type (r,s) at p such that for all Y;€7,, n’€¢7T*,, 


(T+7') (n}, ...,.97, Vp ---, V_) = Py}, 07, Vp --s Y,) 
If Gg eye 


Similarly, multiplication of a tensor by a scalar a R} is defined by the 
rule: (wT) is the tensor such that for all Y;€7,, yeT*,, 


(aT) (n}, ...,9", Yy---, ¥,) = @.T(m}, ....n7, Yq, ---» Y,)- 


With these rules of addition and scalar multiplication, the tensor 
product T%(p) is a vector space of dimension n7*5 over R}. 

Let X;eT, (j= 1 to r) and wicT*, (j = 1 to s). Then we shall 
denote by X, ® ... ®@ X, © w! ®... ® w* that element of T7%(p) which 
maps the element (y', ...,47, Y;,..., Y,) of II® into 


a X)) CARE X,» vies <n’; 2S) (w!, Y,)) ees (ws, .e. 


Similarly, if ReT%(p) and SeT¥(p), we shall denote by R @ S that 
element of 73}7(p) which maps the element (y}, ...,y'+?, Y,, ..., You) 


. > *st@ 
of II¢t¢ into the number 


LiGy:,---. 7%, Yo, ey —.. ete 


ee, Yee 


With the product @, the tensor spaces at p form an algebra 
over R. 


If {E,}, {E“} are dual bases of 7,,, 7'*,, respectively, then 
{EF ORt@ EP OE ® ... Es (ag 6; run from 1 to ), 


will be a basis for 7'%(p). An arbitrary tensor T €7%(p) can be expressed 
in terms of this basis as 


T = T%-%, ...5, Eg, ® ... @ Ey, Q@EX®... @ E's 
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where {7%} are the components of T with respect to the dual 
bases {E,}, {E“} and are given by 


| as — (E%, sae, E*,, E, Once} E,.). 


Relations in the tensor algebra at p can be expressed in terms of the 
components of tensors. Thus 


\a,...4 = ses Pilvexe 
Cle laa Fig Mt ea Pe i Bag nas 


(AME 80) y. =a. a 


+ Nils ett = eet ‘a. oa) 
(Ter y= et a Te aia mL Oe 


Because of its convenience, we shall usually represent tensor relations 
in this way. 

If {E,} and {E*} are another pair of dual bases for T, and T* , they 
can be represented in terms of {E,} and {E%} by 


Ey = ee E, (2.2) 
where ©,“ is an n x n non-singular matrix. Similarly 
Bebe Es (2.3) 


where ©, is another n x n non-singular matrix. Since {E,,}, {E“} are 
dual bases, 


Oo = cE E, = (0s ES, ©,% E,)> = 0,4 0, Oe = 0,7 Des 


ie. 0,7, O%, are inverse matrices, and 6%, = 0%, ®"’,. 
The components 771“, , of a tensor T with respect to the 
dual bases {E,}, {E“} are given by 


ee Te hays = T(E% pee8s E? ’, E,.; 1+) Ey ie 


g 


They are related to the components 7%--%, _,, of T with respect to 
the bases {E,}, {E7} by 


is us eae ( a’ b b 
ae = 7% Bek: ue tay @ li. ®,, ee ®,,, = (2.4) 


The contraction of a tensor T of type (7,s), with components 
T...d . , with respect to bases {E,}, {E*}, on the first contravariant 
and first covariant indices is defined to be the tensor C}(T) of type 
(r—1,s—1) whose components with respect to the same basis are 


Tab... Opa: 1.e. 


OUT) =7%-4,, E,®... @Eg@E'®... @ EY. 
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If {E,}, {E*} are another pair of dual bases, the contraction CHT) 
defined by them is 


CUT) = Tet» Ey @-.. @Ez,@E"®@... @E” 
= OF 04, THE OP... Og4 OF... OF, 
.E,@...@E,@E...@ EF 
= T%..d E,@...@E,@E’®... @ EY = Ci(T), 


so the contraction C} of a tensor is independent of the basis used in its 
definition. Similarly, one could contract T over any pair of contra- 
variant and covariant indices. (If we were to contract over two contra- 
variant or covariant indices, the resultant tensor would depend on the 
basis used.) 

The symmetric part of a tensor T of type (2,0) is the tensor S(T) 
defined by rl 
S(T) (Ne) = 5 {Pn ne) + Tne )} 


for all y1,n.¢7'*,,. We shall denote the components S(T ofS) by 
To). then 1 
JXab) — 5 {Pad 4 Tea}. 


Similarly, the components of the skew-symmetric part of T will be 
denoted by 1 
qian — a {eh (peer 


In general, the components of the symmetric or antisymmetric part of 
a tensor on a given set of covariant or contravariant indices will be 
denoted by placing round or square brackets around the indices. Thus 


T, eset) 


Ayes a,) ; 


1 
= {sum over all permutations of the indices a, to OT ge. 


and 


1 4 
7 {alternating sum over all permutations of the indices 
ay to a, oo 
For example, 


_—isK > 
Koa = HE beg + Kane + K vay — K% yap — K Od 
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A tensor is symmetric in a given set of contravariant or covariant 
indices if it is equal to its symmetrized part on these indices, and is 
antisymmetric if it is equal to its antisymmetrized part. Thus, for 
example, a tensor T of type (0,2) is symmetric if 7, = 3(T.,+Tha)s 
(which we can also express in the form: 7],,) = 0). 

A particularly important subset of tensors is the set of tensors of 
type (0,¢) which are antisymmetric on all ¢ positions (so g < n); such 
a tensor is called a q-form. If A and B are p- and q-forms respectively, 
one can define a (p+q)-form A A B from them, where A is the skew- 
symmetrized tensor product ®; that is, A A B is the tensor of type 
(0, »+q) with components determined by 


(A A By tcf a Ag, Boy: 


This rule implies (A A B) = (—)?2(BA A). With this product, the 
space of forms (i.e. the space of all p-forms for all p, including one- 
forms and defining scalars as zero-forms) constitutes the Grassmann 
algebra of forms. If {E*%} is a basis of one-forms, then the forms 
E% ,... \ E% (a, run from 1 to 7) are a basis of p-forms, as any p-form 
A can be written A= A, ,E7/...AE®, where A, , = Ata...a)- 

So far, we have considered the set of tensors defined at a point on 
the manifold. A set of local coordinates {x*} on an open set Y in @ 
defines a basis {(@/éx*)|,,} of vectors and a basis {(dz*)|,,} of one-forms 
at each point p of Y, and so defines a basis of tensors of type (r,s) at 
each point of Y%. Such a basis of tensors will be called a coordinate 
basis. A C* tensor field Th of type (r,s)onaset¥ < .@isan assignment 
of an element of T7(p) to each point peV such that the components 
of T with respect to any coordinate basis defined on an open subset 
of ¥ are C* functions. 

In general one need not use a coordinate basis of tensors, i.e. given 
any basis of vectors {E,} and dual basis of forms {E*} on ¥, there will 
not necessarily exist any open set in Y on which there are local 
coordinates {x7} such that E, = 0/éx* and E* = dx*. However if one 
does use a coordinate basis, certain specializations will result; in parti- 
cular for any function f, the relations E,(L,f) = E,(E,f) are satisfied, 
being equivalent to the relations @f/éx%0x> = @f/éx> dx". If one 
changes from a coordinate basis E, = @/éx* to a coordinate basis 
E, = 0/éx”, applying (2.2), (2.3) to x7, x* shows that 


Ox shea. 


Clearly a general basis {E,} can be obtained from a coordinate basis 


2 HLS 
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{6/éx*\ by giving the functions Z,* which are the components of the E, 
with respect to the basis {0/dx*}; then (2.2) takes the form E, = E,'0/0x* 
and (2.3) takes the form E* = E%,dz*, where the matrix E%, is dual to 
the matrix E,'. 


2.3 Maps of manifolds 


In this section we define, via the general concept of a C* manifold map, 
the concepts of ‘imbedding’, ‘immersion’, and of associated tensor 
maps, the first two being useful later in the study of submanifolds, and 
the last playing an important role in studying the behaviour of 
families of curves as well as in studying symmetry properties of 
manifolds. 

Amap ¢ froma C* n-dimensional manifold_-# toa C*’ n’-dimensional 
manifold -#’ is said to be a C’ map (r < k, r < k’) if, for any local 
coordinate systems in W and -#@’, the coordinates of the image point 
¢(p) in @’ are C* functions of the coordinates of p in -#. As the map 
will in general be many—one rather than’ one-one (e.g. it cannot be 
one-one if n > n’), it will in general not have an inverse; and if a C’ 
map does have an inverse, this inverse will in general not be C” (e.g. 
if d is the map &!-> Rf} given by x+2, then $7! is not differentiable at 
the point x = 0). a 

Iffis a function on_-@’, the mapping ¢ defines the function $*fon.# 
as the function whose value at the point p of -# is the value of f at 


a $*f(p) = f(P(P)). (2.5) 
Thus when ¢ maps points from -@ to .@’, d* maps functions linearly 
from 4’ to M. 

If A(é) is a curve through the point pe.M, then the image curve 
P(A(t)) in M' passes through the point ¢(p). If r > 1, the tangent 
vector to this curve at ¢(p) will be denoted by ¢,(@/ét),| sp)s One can 
regard it as the image, under the map ¢, of the vector (2/ét) alp- Clearly 
$y is a linear map of T,(#) into T sp)(4'). From (2.5) and the defini- 
tion (2.1) of a vector as a directional derivative, the vector map on 
can be characterized by the relation: for each Ct (r > 1) function f at 
#(p) and vector X at p, 


X(O*f) lp = Sa X(S)| gn (2.6) 
Using the vector mapping ¢, from .@ to.@’, we can ifr > 1 define 


a linear one-form mapping ¢* from T* s(M@) to T*,(M) by the 
condition: vector-one-form contractions are to be preserved under the 
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maps. Then the one-form Ac7T* gp) 18 Mapped into the one-form 
p*AeT*, where, for arbitrary vectors X€¢ ae 


oA, XS = <A, Px X)| g¢- 


A consequence of this is that 


o*(df) = d(p*f). (2.7) 
The maps ¢, and ¢* can be extended to maps of contravariant 


tensors from .W to .@’ and covariant tensors from .#’ to .&# respec- 
tively, by the rules ¢,: TeTi(p) >¢,T €T%(d(p)) where for any 


(eT * .), 
TEN! DS, SM Y|p = be TOE Noon 
| o*: TeT%(d(p)) > 6*T eT (p), 
where for any X;€7,, 


asad MO. tees = IOP. Ge ® $x Xz) |} sq 


When r > 1, the C" map ¢ from -@ to .M’ is said to be of rank s at p 
if the dimension of 0,.(7',(4)) is s. It is said to be injective at pif s = n 
(and son < n’) at p; —_ no vector in J, is mapped to zero by ¢,,. It 
is said to i oe ifs = iy Ouest a 

A C" map ¢ (r ) is said to be an immersion if it and its inverse 
are C7 maps, i.e. a each point pe. there is a neighbourhood 
WU of p in .@ such that the inverse ¢~ restricted to 4(%) is also 

a C7? map. This implies n < n’. By the implicit function theorem 
oer (1965), p.41), when > 1, ¢ will be an immersion if and only if 
it is injective at every point pe.M; then ¢, is an isomorphism of 7, 
into the image 9,(T,) © Typ). The image ¢(-@) is then said to be an 
n-dimensional immersed submanifold in 4’. This submanifold may 
intersect itself, i.e. @ may not be a one-one map from -4@ to ¢(&# 
although it is one—one when restricted toa sufficiently small neighbour- 
hood of .#. An immersion is said to be an imbedding if it is a homeo- 
morphism onto its image in the induced topology. Thus an imbedding 
is a one-one immersion; however not all one-one immersions are 
imbeddings, cf. figure 6. A map ¢ is said to be a proper map if the 
inverse image ¢-(.%) of any compact set # < .#’ is compact. It can 
be shown that a proper one-one immersion is an imbedding. The 
image $(.@) of @ under an imbedding ¢ is said to be an n-dimensional 
imbedded submanifold of 4’. 

The map ¢ from 4 to @’ is said to be a C” diffeomorphism if it is 
a one-one C’ map and the inverse ¢—1 is a C? map from .@ to @. In 


eae 
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this case, n = n’, and ¢ is both injective and surjective if r > 1; con- 
versely, the implicit function theorem shows that if ¢, is both injective 
and surjective at p, then there is an open neighbourhood © of p such 
that 6: Y> $(%) isa diffeomorphism. Thus ¢ is a local diffeomorphism 
near p if d, is an isomorphism from 7, to Ty). 


FIGURE 6. A one-one immersion of #1 in FR? which is not an imbedding, obtained 
by joining smoothly part of the curve y = sin (1/x) to the curve 


{(y,0); —coo<y < 1}. . 


When the map ¢ is a C’ (r > 1) diffeomorphism, ¢, maps T,,(-@) to 
Typ’) and (g-*)* maps T*,(-#) to T* 4(4’). Thus we can define 
amap ¢, of T;(p) to T;(¢(p)) for any r, s, by 


ra Bree: Se eer, 8) 


= Pa P (G7), ---(B7) *8, ba Ris Se XP) sy 


for any X,¢7,,, n'¢7*,. This map of tensors of type (r, s) on @ to 
tensors of type (7, s) on @’ preserves symmetries and relations in the 
tensor algebra; e.g. the contraction of ¢,T is equal to ¢, (the con- 
traction of T). 


2.4 Exterior differentiation and the Lie derivative 

We shall study three differential operators on manifolds, the first two 
being defined purely by the manifold structure while the third is 
defined (see § 2.5) by placing extra structure on the manifold. 
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The exterior differentiation operator d maps r-form fields linearly to 
(r+ 1)-form fields. Acting on a zero-form field (i.e. a function) f, it 
gives the one-form field df defined by (cf. §2.2) 


<df, X> = Xf for all vector fields X (2.8) 
and acting on the r-form field 
A= Ay, gda*rndx® a... Adxt 
it gives the (r+ 1)-form field dA defined by 
dA =d4,, gAdx*rdza... adaxt. (2.9) 


To show that this (r + 1)-form field is independent of the coordinates 
{x*} used in its definition, consider another set of coordinates {a7}. 


Then A= Agygda® dx’ A... A de®, 
where the components A, are given by 


ae ox Gat 
id = Boa Ogb °° Dyed Hab... d- 


Thus the (r+ 1)-form dA defined by these coordinates is 
dA = dA vy... ¢ dae’ A dz” IM S50 1M da? 


A a’b’ 


ox ex = Gar . 

ix (Se a par Aw...) A da® Adz A... A dxt 
6xc® Ox? bat ve . 
= Fat Bal gk TAcn..a A da Ada” a... dxt 


027% Ox? axt 
Ox” Ox% Ox” Ont 
= dA, gAda*adza...adazt 


as 0%x%/dx% dx is symmetric in a’ and e’, but dz A dz” is skew. Note 
that this definition only works for forms; it would not be independent 
of the coordinates used if the A product were replaced by a tensor 
product. Using the relation d(fg) = gdf+fdg, which holds for arbi- 
trary functions f, g, it follows that for any r-form A and form B, 
d(A A B) = dAanB+(—)’AAGB. Since (2.8) implies that the local 
coordinate expression for df is df = (éf/éx*)dz*, it follows that 
d(df) = (@f/éx* dat) dx* n dx = 0, as the first term is symmetric and 
the second skew-symmetric. Similarly it follows from (2.9) that 


d(dA) = 0 


A, «dar nidat ndad a... a da®+...4... 


holds for any r-form field A. 
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The operator d commutes with manifold maps, in the sense: if 
b: M > M' isa Ct (r > 2) map and A is a C* (k > 2) form field on .#’, 
then (by (2.7)) 

d(g*A) = P*(dA) 
(which is equivalent to the chain rule for partial derivatives). 

The operator d occurs naturally in the general form of Stokes’ 
theorem on a manifold. We first define integration of n-forms: let 4 
be a compact, orientable n-dimensional manifold with boundary ¢# 
and let {f,} be a partition of unity for a finite oriented atlas {Y,, ¢,}. 
Then if A is an n-form field on 4, the integral of A over -@ is defined as 


I — ft Aine de® .. ar, (2.10) 
AM a J pal(Wa) 
where A,,_,, are the components of A with respect to the local co- 
ordinates in the coordinate neighbourhood Y,, and the integrals on 
the right-hand side are ordinary multiple integrals over open sets 
$,(%,) of k”. Thus integration of forms on .@ is defined by mapping 
the form, by local coordinates, into R” and performing standard 
multiple integrals there, the existence of the partition of unity 
ensuring the global validity of this operation. 

The integral (2.10) is well-defined, since if one chose another atlas 
{Y%, Wa} and partition of unity {g,} for this atlas, one Would obtain 
the integral 


xf Op Are... de! da® ... da”, 
BJ vas) 


where x* are the corresponding local coordinates. Comparing these 
two quantities in the overlap (W, 1 ¥;) of coordinate neighbourhoods 
belonging to two atlases, the first expression can be written 


La SaGp Are... n Ax'dz?... dz, 
a f J da(Wan¥p) 


and the second can be written 


> Bi — Se GEA 222 COE”... 
phan ¥ 8B) 


«4 
Comparing the transformation laws for the form A and the multiple 


integrals in R”, these expressioris are equal at each point, so Ais 
M 


independent of the atlas and partition of unity chosen. 
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Similarly, one can show that this integral is invariant under 
diffeomorphisms: 
[ A= [A 
A AM 


if @ is a CO’ diffeomorphism (r > 1) from .# to.#’. 
Using the operator d, the generalized Stokes’ theorem can now be 
written in the form: if B is an (n—1)-form field on .@, then 


| —_— { dB, 
OM A 


which can be verified (see e.g. Spivak (1965)) from the definitions 
above; it is essentially a general form of the fundamental theorem of 
calculus. To perform the integral on the left, one has to define an 
orientation on the boundary @.@ of M. This is done as follows: if Y, is 
a coordinate neighbourhood from the oriented atlas of W such that 
U,, intersects 0M, then from the definition of 0.4, ¢,(Y, 1 0M) lies in 
the plane x1 = 0 in R” and ¢,(Y, 0 -@) lies in the lower half 2! < 0. 
The coordinates (x?,z°,...,2") are then oriented coordinates in the 
neighbourhood Y, 9 eH of @M. It may be verified that this gives an 
oriented atlas on 2M. 

The other type of differentiation defined naturally by the manifold 
structure is Lie differentiation. Consider any C’ (r > 1) vector field X 
on @. By the fundamental theorem for systems of ordinary differential 
equations (Burkill (1956)) thereisa unique maximal curve A(t) through 
each point p of such that A(0) = p and whose tangent vector at the 
point A(t) is the vector X|,:). If {x*} are local coordinates, so that the 
curve A(t) has coordinates x*(t) and the vector X has components X?, 
then this curve is locally a solution of the set of differential equations 


dear = X4ar(i), ...52"@)). 


This curve is called the integral curve of X with initial point p. For each 
point q of @, there is an open neighbourhood Y of q and ane > Osuch 
that X defines a family of diffeomorphisms ¢,: @>.M whenever 
|t| < €, obtained by taking each point p in Y a parameter distance t 
along the integral curves of X (in fact, the ¢, form a one-parameter 
local group of diffeomorphisms, as ¢,,, = ¢,0¢, = 9,09, for 
él, |s|, [e+s| <€, so d_»=(¢,)1 and ¢y is the identity). This 
diffeomorphism maps each tensor field T at p of type (r,s) into 


Dix T | 54)- . 
The Lie derivative LyT of a tensor field T with respect to X is 
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defined to be minus the derivative with respect to ¢ of this family of 
tensor fields, evaluated at t = 0, i.e. 


Pe 
LxT|, = lim; {Tlp— bu TI p}- 


From the properties of ¢,, it follows that 

(1) Ly preserves tensor type, i.e. if T is a tensor field of type (r,s), 
then Ly T is also a tensor field of type (7, s); 

(2) Ly maps tensors linearly and preserves contractions. 

As in ordinary calculus, one can prove Leibniz’ rule: 

(3) For arbitrary tensors S, T, Ly(S © T) = LyS ®T+S @L,T. 

Direct from the definitions: 

4) Ly f = Xf, where f is any function. 

Under the map ¢,, the point ¢g = ¢_,(p) is mapped into p. Therefore 

P is a map from 7, to 7,,. Thus, by (2.6), 


(Dix ip ne Y($*f)Iq- 


If {x*} are local coordinates in a neighbourhood of p, the coordinate 
components of ¢,, Y at p are 


(dim Y)'|p = Sea Y|p2t = Yi — ig 5 (itp) 
_ (640) yy 


~~ Beilgy | : 
da($(q)) _ y; 
Now ee =X4l sia: 
d (éx'($,(q)) eet 
therefore di eal _ = Ga |, 5 
ss ey? one rj 
oa (Iq Y)é = 5 (bu ¥) Maco = Gey RSs, (2.11) 


ex) 
One can rewrite this in the form 
(Lx Y)f = X(Yf)— ¥(Xf) 
for all C? functions f. We shall sometimes denote Ly Y by [X, Y], i.e. 
LyY = —LyX = [X, Y] = —[Y, X]. 


If the Lie derivative of two vector fields X, Y vanishes, the vector 
fields are said to commute. In this case, if one starts at a point p, goes 
a parameter distance t along the integral curves of X and then a 
parameter distance s along the integral curves of Y, one arrives at the 
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same point as if one first went a distance s along the integral curves 
of Y and then a parameter distance t along the integral curves of X 
(see figure 7). Thus the set of all points which can be reached along 
integral curves of X and Y from a given point p will then form an 
immersed two-dimensional submanifold through p. 


LE bsv(Oix(p)) 
<= bex(dov(p)) 


FicuRE 7. The transformations generated by commuting vector fields X, Y 
move a point 7p to points ¢.z(p), d,y(p) respectively. By successive applications 
of these transformations, p is moved to the points of a two-surface. 


The components of the Lie derivative of a one-form w may be found 
by contracting the relation 
Lx(w @ Y) = Ly @ Y+0 @ LY 
(Lie derivative property (3)) to obtain 


Lg w, Y) = (Lge, ¥) + (wo, LxY) 

(by property (2) of Lie derivatives), where X, Y are arbitrary C1 
vector fields, and then choosing Y as a basis vector E,. One finds the 
coordinate components (on choosing E, = @/@x*) to be 
because (2.11) implies hs 

(Ly(6/éx*))i = — 0X62. 
Similarly, one can find the components of the Lie derivative of any 
Cr (r > 1) tensor field T of type (r,s) by using Leibniz’ rule on 


Ly(T @E*®...@E4@E,®...@E,), 
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and then contracting on all positions. One finds the coordinate com- 
ponents to be 


(Lg Te =-4 as = Megat es) eG = LSS ae OX/aaxt 


— (all upper indices) + 71-4, 0X*/dx? + (all lower oe cs 


Because of (2.7), any Lie derivative commutes with d, i.e. for any 
p-form field w, dance acs). 


From these formulae, as well as from the geometrical interpretation, 
it follows that the Lie derivative LyT|, of a tensor field T of type 
(r,s) depends not only on the direction of the vector field X at the 
point p, but also on the direction of X at neighbouring points. Thus 
the two differential operators defined by the manifold structure are 
too limited to serve as the generalization of the concept ofa partial 
derivative one needs in order to set up field equations for physical 
quantities on the manifold; d operates only on forms, while the 
ordinary partial derivative is a directional derivative depending only 
on a direction at the point in question, unlike the Lie derivative. One 
obtains such a generalized derivative, the covariant derivative, by 
introducing extra structure on the manifold. We do this in the next 


section. ~ 


2.5 Covariant differentiation and the curvature tensor 


The extra structure we introduce is a (affine) connection on .#. 
A connection V at a point p of is a rule which assigns to each vector 
field X at p a differential operator Vz which maps an arbitrary 
C" (r > 1) vector field Y into a vector field Vz Y, where: 


(1) VY is a tensor in the argument X, i.e. for any functions f, g, 
and C! vector fields X, Y, Z, 


VixtoyZ =fVxZ+9VyZ; 


(this is equivalent to the requirement that the derivative Vx at 7p 
depends only on the direction of X at p); 
(2) Vx ¥ is linear in Y, i.e. for any C! vector fields Y, Z and a, Pe R?, 


(3) for any C1 function f and C! vector field Y, 
VxG¥) = Xf) ¥ +7 vay. 
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Then VY is the covariant derivative (with respect to V) of Y in the 
direction X at p. By (1), we can define VY, the covariant derivative of Y, 
as that tensor field of type (1,1) which, when contracted with X, 
produces the vector V; Y. Then we have 

(3)>V(fY)=df@Y+fVY. 


A C” connection V on a C* manifold .W (k > r+2) is a rule which 
assigns a connection V to each point such that if Y is a C’+! vector field 
on -#, then VY is a C’ tensor field. 

Given any C’*! vector basis {E,} and dual one-form basis {E?} on 
a neighbourhood %, we shall write the components of VY as Y eee 


VY = Y*,,E°@E,. 
The connection is determined on Y by n° C* functions [™,, defined by 
D4, = (E*, Vz,E,) <> VE, = ©, E°@E,. 
For any C' vector field Y, 
VY = V(Y°E,) =d ¥Y°@E,+ YT4,, E°OE,- 
Thus the components of VY with respect to coordinate bases {0/dx%, 
{dx} are Y4., = 0¥¢/é0+T4,, Ye. 
The transformation properties of the functions ['¢,, are determined by 
connection properties ( 1), (2)2 (3); for 
Pye = (E*, Vay Ep) = (0%, E*, Voyon,(Po°E,)) 

= 077 0," E,( 9,7) + O° P%,.) 
ifE, = ®,4E,, E* = 6”, E*. One can rewrite this as 

Pee ee OP eee Df 14): 
In particular, if the bases are coordinate bases defined by coordinates 
{x7}, {x}, the transformation law is 
= Coke Ox” r*,} 


a” Sg.) ee a ig ee 
ye = Ox® \ex" Ba? * One Oxe’ 


Because of the term E,(®,“), the 14, do not transform as the compo- 
nents of a tensor. However if VY and VY are covariant derivatives 
obtained from two different connections, then 

VY-VY = (I,,—[4,,) Y°E°@E, 
will be a tensor. Thus the difference terms (I°%,,— f,,) will be the 
components of a tensor. 
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The definition of a covariant derivative can be extended to any 
Cr tensor field if r > 1 by the rules (cf. the Lie derivative rules): 

(1) if T is a C* tensor field of type (q, s), then VT isa C’—! tensor field 
of type (q, 8+ 1); 

(2) V is linear and commutes with contractions; 

(3) for arbitrary tensor fields S, T, Liebniz’ rule holds, 1.e. 


V(S@T) = VS@T+S@VT; 
(4) Vf = df for any function f. 
We write the components of VT as (Vg,7')*""%e...9 = TOE spp NG 
a consequence of (2) and (3), 


Vg, EC a ee Oe 
where {E%} is the dual basis to {E,}, and methods similar to those used 
in deriving (2.12) show that the coordinate components of VT are 


[Odd og = OTM2, [BaP +18, TH dy 
+ (all upper indices) —I¥,,7'%-4;,,_g— (all lower indices). (2.13) 


As a particular example, the unit tensor E,@E*, which has compo- 
nents 6%,, has vanishing covariant derivative, and so the generalized 
unit tensors with components 6%, d%,,...d4),,, dla, d%,. ... OP, 
(p < n) also have vanishing covariant derivatives. 

If T is a C* (r > 1) tensor field defined along a C’ curve AQ), one can 
define DT /ét, the covariant derivative of T along A(t), as Vajae T where T 
is any C* tensor field extending T onto an open neighbourhood of A. 
DT/éat is a C*-1 tensor field defined along A(t), and is independent of 
the extension T. In terms of components, if X is the tangent vector 
to A(t), then DT*--4, ,/ét = T--4,_,,, X”. In particular one can choose 
local coordinates so that A(t) has the coordinates x(t), X* = dx%/dt, 
and then for a vector field Y 

DY =cY*/o-- "7 ae jae. (2.14) 

The tensor T is said to be paraillelly transported along A if DT /et = 0. 
Given a curve A(t) with endpoints p, q, the theory of solutions of 
ordinary differential equations shows that if the connection V is at 
least C!- one obtains a unique tensor at qg by parallelly transferring 
any given tensor from p along A. Thus parallel transfer along A is a 
linear map from 7'%(p) to T’(q) which preserves all tensor products and 
tensor contractions, so in particular if one parallelly transfers a basis 
of vectors along a given curve from p to q, this determines an iso- 
morphism of 7, to 77. (If there are self-intersections in the curve, 
p and g could be the same point.) 
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A particular case is obtained by considering the covariant deriva- 
tive of the tangent vector itself along A. The curve A(t) is said to be 


a geodesic curve if 
$n DO 
at \ at), 


is parallel to (0/ét),, i.e. if there is a function f (perhaps zero) such that 
X*,,X° = fX*. For such a curve, one can find a new parameter v(t) 
along the curve such that 


such a parameter is called an affine parameter. The associated tangent 
vector V = (é/év), is parallel to X but has its scale determined by 
V(v) = 1; it obeys the equations 
28 b Cc 

Ve. (eee Neh ee ae) 
the second expression being the local coordinate expression obtainable 
from (2.14) applied to the vector V. The affine parameter of a geodesic 
curve is determined up to an additive and a multiplicative constant, 
i.e. up to transformations v’ = av+b where a, b are constants; the 
freedom of choice of 6 corresponds to the freedom to choose a new 
initial point A(0), the freedom of choice in a corresponding to the 
freedom to renormalize the vector V by a constant scale factor, 
V’ = (1/a) V. The curve parametrized by any of these affine parameters 
is said to be a geodesic’ 

Given a C” (r > 0) connection, the standard existence theorems for 
ordinary differential equations applied to (2.15) show that for any 
point p of @ and any vector X, at p, there exists a maximal geodesic 
Ax(v) in 4 with starting point p and initial direction X,,, i.e. such that 
Ax(0) = p and (4/@v),|,-9 = X,. Ifr > 1—, this geodesic is unique and 
depends continuously on p and X,,. Ifr > 1, it depends differentiably 
on p and X,. This means that if r > 1, one can define a C’™ map exp: 
T,, > 4, where for each Xe T,,, exp (X) is the point in # a unit para- 
meter distance along the geodesic Ay from p. This map may not be 
defined for all X€7,,, since the geodesic Ax(v) may not be defined for 
all v. If v does take all values, the geodesic A(v) will be said to be a 
complete geodesic. The manifold -@ is said to be geodesically complete 
if all geodesics on .@ are complete, that is if exp is defined on all T, for 
every point p of 4. 

Whether 4 is complete or not, the map exp, is of rank n at p. There- 
fore by the implicit function theorem (Spivak (1965)) there exists an 


(2.15) 
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open neighbourhood -% of the origin in 7, and an open neighbourhood 
AN, of p in M such that the map exp is a C’ diffeomorphism of -V 
onto ¥,. Such a neighbourhood -%, is called a normal neighbourhood 
of p. Further, one can choose WV, to be convex, i.e. to be such that any 
point q of VW, can be joined to any other point 7 in 7, by a unique 
geodesic starting at g and totally contained in ,. Within a convex 
normal neighbourhood WY one can define coordinates (zx1,...,%”) by 
choosing any point gé.V, choosing a basis {E,} of T,,, and defining the 
coordinates of the point r in.V by the relation r = exp (x*E,) (i.e. one 
assigns to r the coordinates, with respect to the basis {E,}, of the point 
exp~l(r) in T,.) Then (@/éz*)|, = E, and (by (2.15)) Pj], = 0. Such 
coordinates will be called normal coordinates based on q. The existence 
of normal neighbourhoods has been used by Geroch (1968c) to prove 
that a connected C? Hausdorff manifold 4 with a C! connection has 
a countable basis. Thus one may infer the property of paracompactness 
of a C? manifold from the existence of a C1 connection on the manifold. 
The ‘normal’ local behaviour of geodesics in these neighbourhoods is 
in contrast to the behaviour of geodesics in the large in a general space, 
where on the one hand two arbitrary points cannot in general be 
joined by any geodesic, and on the other hand some of the geodesics 
through one point may converge to ‘focus’ at some other point. We 
shall later encounter examples of both types of behaviour. 

Given a C* connection V, one can define a C’-! tensor field T of 
type (1, 2) by the relation 


Tie Sey vee eee 


where X, Y are arbitrary C’ vector fields. This tensor is called the 
torston tensor. Using a coordinate basis, its components are 
T*, = M,-T4,;. 

We shall deal only with torsion-free connections, i.e. we shall assume 
T = 0. In this case, the coordinate components of the connection obey 
I, = I,;, 80 such a connection is often called a symmetric connec- 
tion. A connection is torsion-free if and only if f, ij =J.;; for all func- 
tions f. From the geodesic equation (2.15) it follows that a torsion-free 
connection is completely determined by a knowledge of the geodesics 
on &. 

When the torsion vanishes, the covariant derivatives of arbitrary C1 
vector fields X, Y are related to their Lie derivative by 


[X, ¥] = Vg¥—VyX<>(LgY)* = ¥o,, X_N, 72, (2,16) 
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and for any C! tensor field T of type (r,s) one finds 
(LgT)®-4, 7 = Tt4, Xe Td) Xa 
— (all upper indices) + 7-¢,,. | X?,,+ (all lower indices). (2.17) 


One can also easily verify that the exterior derivative is related to the 
covariant derivative by 


dA = Aq gue Ada? A ... A dar< (dA) ea cas aa Ata ee ab 


where A is any p-form. Thus equations involving the exterior deriva- 
tive or Lie derivative can always be expressed in terms of the co- 
variant derivative. However, because of their definitions, the Lie 
derivative and exterior derivative are independent of the connection. 

If one starts from a given point p and parallelly transfers a vector 
X,, along a curve y that ends at p again, one will obtain a vector X’, 
which is in general different from X,,; if one chooses a different curve 
y’, the new vector one obtains at p will in general be different from 
X, and X’,,. This non-integrability of parallel transfer corresponds to 
the fact that the covariant derivatives do not generally commute. The 
Riemann (curvature) tensor gives a measure of this non-commutation. 
Given C’+1 vector fields X, Y, Z, a C’-! vector field R(X, Y) Z is defined 
by a C” connection V by 


Then R(X, Y)Z is lingar“in X, Y,Z and it may be verified that the 
value of R(X, Y) Z at p depends only on the values of X, Y, Z at p, i.e, 
it is a C’-1 tensor field of type (3,1). To write (2.18) in component 
form, we define the second covariant derivative VVZ of the vector Z 
as the covariant derivative V(VZ) of VZ; it has components 
Z* 7 (2°. 5); 
Then (2.18) can be written 
Began? = (Ze. 4Y%), Xo —(2*, aX"), nae 
So Zt. al re, re — ¢ 63) 
= (74. ee aa) 7D Bo 
where the Riemann tensor components R%,,., with respect to dual 
bases {E,}, {E%} are defined by R%,,¢ = (E*, R(E,, Ez) E,). As X, ¥ are 
arbitrary vectors, ZA, ag —Z, 0g = BR j0q Z (2.19) 
expresses the non-commutation of second covariant derivatives of Z 
in terms of the Riemann tensor. 
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Since 
Vz(n @Vy Z) = Ven@VyZ+n@VxVyZ 
=> <n, VzVyZ) = X((n, Vy Z)) — (Van, Vy Zz» 

holds for any C? one-form field n and vector fields X, Y, Z, (2.18) 
implies 

(E*, R(E,, Ez) E,) = E,({E*, Ve, E,)) — Za(<E*, Ve, Bo>) 

— (Vg, E*, Vz, Ep) + (Vz, E*; Ve, Ey) — (E* Vin, Eo). 
Choosing the bases as coordinate bases, one finds the expression 
RY yeq = OF gy/ Bx? — OP [0x4 + PP %an—Teag lly (2-20) 


for the coordinate components of the Riemann tensor, in terms of the 
coordinate components of the connection. 

It can be verified from these definitions that in addition to the 
egoe! RG 0q = — RB yae<> Rye = 9 (2.21a) 
the curvature tensor has the symmetry 

RY hoa = 0<> R% 0g + Rape + R% cay = 0. (2.21b) 


Similarly the first covariant derivatives of the Riemann tensor satisfy 
Bianchi’s wdentitres 
- 
Rca; = 0<> R%yea; e+ Prec; at Rode; e = 9. (2.22) 
It now turns out that parallel transfer of an arbitrary vector along 
an arbitrary closed curve is locally integrable (i.e. X’,, is necessarily the 
same as X,, for each pe.) only if R%,,q = 0 at all points of .@; in this 
case we say that the connection is flat. 


By contracting the curvature tensor, one can define the Ricci tensor 
as the tensor of type (0, 2) with components 


Boa = Baa: 
2.6 The metric 


A metric tensor § at a point pe.M is a symmetric tensor of type (0, 2) 
at p, so aC’ metric on .@ is a C” symmetric tensor field §. The metric § 
at p assigns a ‘magnitude’ (|g(X, X)|)# to each vector Xe7, and 
defines the ‘cos angle’ 

‘ g(X,Y) 


(g(X, X) .g(¥, Y)|)# 


between any vectors X, Ye 7, such that g(X, X).g(Y, Y) + 0; vectors 
X, Y will be said to be orthogonal if g(X, Y) = 0. 
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The components of g with respect to a basis {E,} are 


Jad = g(E,, E,) = g(E,, E,); 


i.e. the components are simply the scalar products of the basis vectors 
E,. If a coordinate basis {2/227} is used, then 


& =9,,dx*@dz», (2.23) 


Tangent space magnitudes defined by the metric are related to 
magnitudes on the manifold by the definition: the path length between 
points p = y(a) and g = y(b) along a C°, piecewise C} curve y(t) with 
tangent vector ¢/ét such that g(é/0t, 2/0t) has the same sign at all points 
along y(t), is the quantity 


Ee | ” (|g(a/at, a/at)| Edt. (2.24) 


We may symbolically express the relations (2.23) ,(2.24) in the form 
ds? = g,,dx* dx! 


used in classical textbooks to represent the length of the ‘infinitesimal’ 
are determined by the coordinate displacement xt > x‘ + dat, 

The metric is said to be non-degenerate at p if there is no non-zero 
vector Xe7T, such that 9(X, Y) = 0 for all vectors Ye 7. In terms of 
components, the metric is non-degenerate if the matrix (g,,,) of com- 
ponents of g is non-singular, We shall from now on always assume the 
metric tensor is non-degenerate. Then we can define a unique sym- 
metric tensor of type (2, 0) with components g” with respect to the 
basis {E,} dual to the basis {E%}, by the relations 

oe Dae a oe 
i.e. the matrix (g*°) of components is the inverse of the matrix (9,,). 
It follows that the matrix (g?) is also non-singular, so the tensors 
g, Jan can be used to give an isomorphism between any covariant 
tensor argument and any contravariant argument, or to ‘raise and 
lower indices’. Thus, if X¢% are the components of a contravariant 
vector, then X,, are the components of a uniquely associated covariant 
vector, where X, = 9,,4°, X% = g@X,; similarly, to a tensor 7), of 
type (0,2) we can associate unique tensors 7%, = 9%, T,? = g"°T.,-; 
JT — ggbtT',, We shall in general regard such associated covariant 
and contravariant tensors as representations of the same geometric 
object (so in particular, g,,, 6,°and g® may be thought of as representa- 
tions (with respect to dual bases) of the same geometric object §), 
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although in some cases where we have more than one metric we shall 
have to distinguish carefully which metric is used to raise or lower 
indices. 

The signature of § at p is the number of positive eigenvalues of the 
matrix (g,,) at p, minus the number of negative ones. If § is non- 
degenerate and continuous, the signature will be constant on .@; by 
suitable choice of the basis {E,}, the metric components can at any 
point p be brought to the form 

Io = diag(+1, +1,..., +1, —1,..., —1), 
Ne ee 
4(n+s)terms 4(n—s) terms 
where s is the signature of § and n is the dimension of .4. In this case 
the basis vectors {E,} form an orthonormal set at p, i.e. each is a unit 
vector orthogonal to every other basis vector. 


A metric whose signature is n is called a positive definite metric; for 
such a metric, g(X, X) = 0 > X = 0, and the canonical form is 


G5, — Giag( Fi, ..2.b 1), 
———_Leie 
n terms 


A positive definite metric is a ‘metric’ on the space, in the topological 
sense of the word. 

A metric whose signature is (n— 2) is called a Lorentz metric; the 
canonical form is 2 

Jap = diag (+1,..., +1, —1). 
—— ee 
(n—1)terms 

With a Lorentz metric on .@, the non-zero vectors at p can be divided 
into three classes: a vector XeT,, being said to be timelike, null, or 
spacelike according to whether g(X, X) is negative, zero, or positive, 
respectively. The null vectors form a double cone in T,, which separates 
the timelike from the spacelike vectors (see figure 8). If X, Y are any 
two non-spacelike (i.e. timelike or null) vectors in the same half of the 
light cone at p, then g(X, Y) < 0, and equality can only hold if X and 
Y are parallel null vectors (i.e. if X = «Y, g(X, X) = 0). 

Any paracompact C" manifold admits a C’— positive definite metric 
(that is, one defined on the whole of .). To see this, let {f,} be a parti- 
tion of unity for a locally finite atlas {Y,, ¢,}. Then one can define g by 


g(X, ¥) am UhaC(Pa) x (P2)% Y); 


where ¢ ,  ) is the natural scalar product in Euclidean space R”; 
thus one uses the atlas to determine the metric by mapping the 
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Null cone 


Null vectors lie 
7 on the null cones 


Timelike vectors 
lie inside the 
null cones 


Spacelike vectors lie 
outside the null cones 
E 


nm-1 


Hyperplane spanned by 
1 Brig coe en 


Null cone 


Ficure 8. The null cones defined by a Lorentz metric. 


Buclidean metric into -@, This is clearly not invariant under change of 
atlas, so there are many such positive definite metrics on 4. 

In contrast to this, a C’ paracompact manifold admits a Cr-} 
Lorentz metric if and only if it admits a non-vanishing C’—! line 
element field; by a line element field is meant an assignment of a pair 
of equal and opposite vectors (K, — X) at each point p of 4, i.e. a line 
element field is like a vector field but with undetermined sign. To see 
this, let & be a C’—! positive definite metric defined on the manifold. 
Then one can define a Lorentz metric § by 
g(X, Y)9(X, Z) 

9(X, X) 
at each point p, where X is one of the pair (X, — X) at p. (Note that as 
X appears an even number of times, it does not matter whether X or 
—X is chosen.) Then g(X, X) = —9(X, X), and if Y, Z are orthogonal 
to X with respect to 8, they are also orthogonal to X with respect to 
g and 9(Y, Z) = 9(Y, Z). Thus an orthonormal basis for § is also an 
orthonormal basis for 8. As & is not unique, there are in fact many 


g(Y, Z) = 60,2) 2 
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Lorentz metrics on -@ if there is one. Conversely, if $ is a given 
Lorentz metric, consider the equation g,, X° = Ag,,X° where § is any 
positive definite metric. This will have one negative and (n-—1) 
positive eigenvalues. Thus the eigenvector field X corresponding to 
the negative eigenvalue will locally be a vector field determined up to 
a sign and a normalizing factor; one can normalize it by g,,X°X° = — 1, 
so defining a line element field on -4@. 

In fact, any non-compact manifold admits a line element field, 
while a compact manifold does so if and only if its Euler invariant is 
zero (e.g. the torus T? does, but the sphere S* does not, admit a line 
element field). It will later turn out that a manifold can be a reasonable 
model of space-time only if it is non-compact, so there will exist many 
Lorentz metrics on 4. 

So far, the metric tensor and connection have been introduced as 
separate structures on .#. However given a metric § on -M@, there is 
a unique torsion-free connection on -@ defined by the condition: the 
covariant derivative of ¢ is zero, i.e. 

Jab; c = 0. ; ; (2.25) 
With this connection, parallel transfer of vectors preserves scalar 
products defined by g, so in particular magnitudes of vectors are 
invariant. For example if @/¢t is the tangent vector to a geodesic, then 
g(0/ét, 2/et) is constant along the geodesic. ie 

From (2.25) it follows that 


X(9(Y, Z)) = V(9(¥, Z)) = Vx9(¥, Z)+9(VxY, Z) 
+9(¥, Vg Z) = 9(VzY, Z)+9(Y, VzZ) 
holds for arbitrary C1 vector fields X, Y, Z. Adding the similar expres- 
sion for Y(9(Z, X)) and subtracting that for Z(g(X, Y)) shows 


g(Z, Vx Y) = 3{-Z(g(X, Y)) + Y(g(Z, X)) + X(9(Y, Z)) 
Choosing X, Y, Z as basis vectors, one obtains the connection 
Cee ae = g(E,, Vz, E,) = Jaa De 


in terms of the derivatives of the metric components Jan = G(E,, E,), 
and the Lie derivatives of the basis vectors. In particular, on using 
a coordinate basis these Lie derivatives vanish, so one obtains the 
usual Christoffel relations 


ee = {09 an/ 02° = 29 ae] Ox? _ 29 5o/ 02%} (2.26) 
for the coordinate components of the connection. 
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From now on we will assume that the connection on -@ is the unique 
C*—! torsion-free connection determined by the C’ metric g. Using this 
connection, one can define normal coordinates (§2.5) in a neighbour- 
hood of a point g using an orthonormal basis of vectors at q. In these 
coordinates the components g,, of at q will be +6,, and the compo- 
nents I¢,, of the connection will vanish at g. By ‘normal coordinates’, 
we shall in future mean normal coordinates defined using an ortho- 
normal basis. 

The Riemann tensor of the connection defined by the metric is a 
C’- tensor with the symmetry 


Riavrea = 0 Rarea = Sis (2.27a) 


in addition to the symmetries (2.21); as a consequence of (2.21) and 
(2.27a), the Riemann tensor is also symmetric in the pairs of indices 


{ab}, {cd}, i.e. oe (2.276) 


This implies that the Ricci tensor is symmetric: 
Jia (2.27c) 
The curvature scalar R is the contraction of the Ricci tensor: 
B= Bo, = Pj.a9°4 


With these symmetries, there are {yn?(n”— 1) algebraically inde- 
pendent components of R,,,,g, where n is the dimension of M; 3n(n + 1) 
of them can be represented by the components of the Ricci tensor. If 
n= 1, Rireg = 0; if n = 2 there is one independent component of 
Rineas Which is essentially the function R. If n = 3, the Ricci tensor 
completely determines the curvature tensor; if n > 3, the remaining 
components of the curvature tensor can be represented by the Weyl 
tensor C4, defined by 


2 F 2 
Coe — Raven t ao {Gata Rav + Joie Raa} G1) (n—2) FG atcIaiv: 


n—2 


As the last two terms on the right-hand side have the curvature tensor 
symmetries (2.21), (2.27), it follows that C,,,¢ also has these sym- 
metries. One can easily verify that in addition, 


C%aa = 9, 


i.e. one can think of the Weyl tensor as that part of the curvature 
tensor such that all contractions vanish. 
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An alternative characterization of the Wey] tensor is given by the 
fact that it is a conformal invariant. The metrics g and 8 are said to 


be conformal if 8 =a% (2.28) 
for some non-zero suitably differentiable function Q. Then for any 
vectors X, Y, V, W at a point p, 

g(X,¥) _ 9(X.¥) 

g(V,W) 9(V, W)’ 
so angles and ratios of magnitudes are preserved under conformal 


transformations; in particular, the null cone structure in 7; is pre- 
served by conformal transformations, since 


gix,X)> 0, = 0, < O= 9X X) > 0] Oo =< 0; 


respectively. As the metric components are related by 
Pan = gan, GP = Q-*g”, 


the coordinate components of the connections defined by the metrics 
(2.28) are related by 


Ba, = Pe. 27 (54.55 + 005 = 


eQ 
bay cat Oa 5 = 99°53) (2.29) 


Calculating the Riemann tensor of §, one finds 
a = Q-? RY 4 a alr On, 
where 09,:= 40-071), g% — 2(Q-4), (Q-), 29°404,; 


~ 


the covariant derivatives in this equation are those determined by the 
metric g. Then (assuming 7 > 2) 


R, = 0-2R, + (n— 2)O-1(0-4), 4.9% — (n — 2)-2.0- n(Ot=2\ 90 
and com = Ones 


the last equation expressing the fact that the Weyl tensor is con- 
formally invariant. These relations imply 


R = Q-*R—2(n—- 1) 2-80, gg — (n= 1) (n— 4) Q, .O. 49°. (2.30) 


Having split the Riemann tensor into a part represented by the 
Ricci tensor and a part represented by the Weyl tensor, one can use 
the Bianchi-identities (2.22) to obtain differential relations between 
the Ricci tensor and the Wey] tensor: contracting (2.22) one obtains 


R430 = Fngse— Meg (2.31) 
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and contracting again one obtains 
ies = ah... 


From the definition of the Weyl tensor, one can (if n > 3) rewrite 
(2.31) in the form 
n—3 1 
CO Kaa = 205 (Rao sp—yMeB 0d): (2.32) 


If n < 4, (2.31) contain all the information in the Bianchi identities 
(2.22), so if n = 4, (2.32) are equivalent to these identities. 

A diffeomorphism 9: #—-.@ will be said to be an isometry if it 
carries the metric into itself, that is, if the mapped metric ¢, § is equal 
to g at every point. Then the map ¢,:7,—>Ty,) preserves scalar 
products, as 


G(X, Y) |p = Ox 9( Ga ¥, x Y)/ 4 = (Pa X OxY)| gp 
If the local one-parameter group of diffeomorphisms ¢, generated 
by a vector field K is a group of isometries (i.e. for each ¢, the trans- 
formation ¢, is an isometry) we call the vector field K a Killing vector 
field. The Lie derivative of the metric with respect to K is 
oo 
[gg = lim (8-918) iN), 
since § = ¢,4 for! each 4 But from (2.17), Lgga, = 2Kia;y; 80 
Killing vector field K satisfies Killing’s equation 
Kop k = 0. (2.33) 


Conversely, if K is a vector field which satisfies Killing’s equation, 
then [g8 = 0, so 


tq ; 
Pix Bly = Blt | ap GraB)l dé 
td ; 
= gl +| s,¢0 «Ps *&)s-0 | ,at 
t d 
= al+ |. (Gra zz%o28) 


t 
— él, | dy * (Lg8| 5_ ip) dt’ = ol 


dt’ 
Dp 


Thus K is a Killing vector field if and only if it satisfies Killing’s equa- 
tion. Then one can locally choose coordinates x* = (z’, t)(v = 1ton—1) 
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such that K* = éx%/ét = 6%,; in these coordinates Killing’s equation 
Lae ene 29 an] a = 094 = Jar(X”) : 

A general space will not have any symmetries, and so will not admit 
any Killing vector fields. However a special space may admit r 
linearly independent Killing vector fields K, (a = 1,...,7). It can be 
shown that the set of all Killing vector fields on such a space forms a 
Lie algebra of dimension 7 over R, with the algebra product given by 
the Lie bracket [ , ] (see (2.16)), where 0 <r < 4n(n+1). (The 
upper limit may be lessened if the metric is degenerate.) The local 
group of diffeomorphisms generated by these vector fields is an 
r-dimensional Lie group of isometries of the manifold .@. The full 
group of isometries of .@ may include some discrete isometries (such 
as reflections in a plane) which are not generated by Killing vector 
fields; the symmetry properties of the space are completely charac- 
terized by this full group of isometries. 


2.7 Hypersurfaces 


If S is an (n — 1)-dimensional manifold and 6: 7+ is an imbedding, 
the image 0(/) of # is said to be a hypersurface in W. If pe FY, the 
image of T, in T,) under the map 6, will be a (n — 1)-dimengional plane 
through the origin. Thus there will be some non-zero form nels 
such that for any vector Xe7,, (n, 0, X) = 0. The form n is unique 
up to a sign and a normalizing factor, and if 0(.) is given locally by 
the equation f = 0 where df + 0 then n may be taken locally as df. 
If 6(.S/) is two-sided in -@, one can choose n to be a nowhere zero 
one-form field on 6(S). This will be the situation if Y and .# are both 
orientable manifolds. In this case, the choice of a direction of n will 
relate the orientations of 6(Y) and of -M: if {z*\ are local coordinates 
from the oriented atlas of W such that locally 6() has the equation 
xz! = 0 and n = adz! where « > 0, then (x7, ...,2”) are oriented local 
coordinates for 6(.S). 

If g is a metric on 4, the imbedding will induce a metric 6*§ on F&, 
where’ if YET, OBEY) pag(Ge Xie, Y)|a- This metric is 
sometimes called the first fundamental form of ¥. If ¢ is positive 
definite the metric 6*g will be positive definite, while if £ is Lorentz, 
6*g will be 

(a) Lorentz if g*n,n, > 0 (in this case, O(S) will be said to be a 
tumeltke hypersurface), 
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(b) degenerate if g*n,n, = 0 (in this case, O(/) will be said to be a 
null hypersurface), 

(c) positive definite if gn, n, < 0 (in this case, 6(S) will be said to 
be a spacelike hypersurface). 

To see this, consider the vector N® = n,g%. This will be orthogonal 
to all the vectors tangent to 6(), i.e. to all vectors in the subspace 
H = 0,(T,) in Ty). Suppose first that N does not itself lie in this 
subspace. Then if (Ey, ..., E,,) are a basis for T,,, (N, 0,(E.), ..., 04(E,)) 
will be linearly independent and so will be a basis for J/,. The compo- 
nents of § with respect to this basis will be 

= (a 0 - Gee 0 

a 0 g(94(E,), 6,(E;)) - 0 Man) 
As the metric § is assumed to be non-degenerate, this shows that 
g(N, N) + 0. If g is positive definite, g(N, N) must be positive and so 
the induced metric 6*g must also be positive definite. If g is Lorentz 
and g(N, N) = g%n,”, < 0, then 6*§ must be positive definite since 
the matrix of the components of § has only one negative eigenvalue. 
Similarly if g(N, N) = 9%n,7, > 0, then 0*g will be a Lorentz metric. 
Now suppose that N is tangent to 6(.7). Then there is some non-zero 
vector Xe7,, such that 6,(X) = N. But g(N,6,Y) = 0 for all Ye7,, 
which implies 6* g(X, Y) = 0. Thus 6*g is degenerate. Also, taking 
Y¥aowhe X, giN, N)i=— 777,77, = 0. 

If gn, 7, + 0, one can normalize the normal form n to have unit 
magnitude, i.e. g%’n,n, = +1. In this case the map 6*:T*,,,>T*, 
will be one-one on the (n—1)-dimensional subspace H*,,) of T’*4,) 
consisting of all forms w at 6(p) such that g%n,w, = 0, because 
6*n = 0 and n does not lie in H*. Therefore the inverse (0*)-4 will be 
a map 6, of 7'*, onto H*,,), and so into T'*y,). 

This map can be extended in the usual way to a map of covariant 
tensors on Y to covariant tensors on 6(./) in -@; as there already is 
a map 0,, of contravariant tensors on / to 6(/), one can extend 6, to 
a map 6, of arbitrary tensors on F to 6(Y). This map has the property 
that 6,7 has zero contraction with n on all indices, i.e. 

CHU Aa =0 and (Cy, go = 0 
for any tensor Te 7%(-S). 

The tensor h on 6(S) is defined by h = 6,(0*g). In terms of the 
normalized form n (remember gn,” = + 1), 

hap = Jap + NaN 


since this implies 0*h = 6*g and h,,g*°n, = 9. 
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The tensor h?, = 9h,» is a projection operator, ie. h%,h°, = h%,. It 
projects a vector X € Tj, into its part lying in the subspace H =,(T5) 
of Typ, tangent to O(S), 


Xa = ha, X°+ nn, X®, 


where the second term represents the part of X orthogonal to (7). 
Also h4, projects a form we7'*,,) into its part lying in the subspace 
Bo)’ Og = h?, 0, + NN Ws. 

Similarly one can project any tensor T €7';(@(p)) into its part in 


H°(9(p)) = Hyg) ®--.@ Hyg) @ Hj q)®-.. OH gy, 
ee 


r factors s factors 


i.e. its part which is orthogonal to n on all indices. 

The map 6, is one-one from 7), to Hy,). Therefore one can define 
a map 6* from Ty,) to T,, by first projecting with h%, into H,,) and then 
using the inverse (@),1. As one already has a map 6* of forms on 6(.7) 
to forms on , one can extend the definition of 6* to a map O* of 
tensors of any type on 6(.7) to tensors on .Y. This map has the property 
that 6*(6,T) = T for any tensor Te 7"(p) and 6,(6*T) = T for any 
tensor T €H%(0(p)). We shall identify tensors on with tensors in 
H™ on 6(F) if they correspond under the maps 6,, 6*. Ig particular, 
h can then be regarded as the induced metric on 0(7). 

If iis any extension of the unit normal n onto an open neighbour- 
hood of 6(/) then the tensor y defined on 0(Y) by 


= da 
Xab = heh be; a 


is called the second fundamental form of /. It is independent of the 
extension, since the projections by h*, restrict the covariant deriva- 
tives to directions tangent to6(/). Locally the field fi can be expressed 
in the form fi = adf where f and a are functions on -W and f = 0 on 
O(S). Therefore y,, must be symmetric, since f,., = Segond f.gh®, = 0. 

The induced metric h = 6*g on ¥ defines a connection on £. We 
shall denote covariant differentiation with respect to this connection 
by a double stroke, ||. For any tensor T ¢ H’, 


het Bf he b 
Di ig preg oie ees er 


where T is any extension of T to a neighbourhood of 6(f). This 
definition is independent of the extension, as the hs restrict the 
covariant differentiation to directions tangential to 6(S). To see this 


2.7] HYPERSURFACES 47 


is the correct formula, one has only to show that the covariant deriva- 
tive of the induced metric is zero and that the torsion vanishes. This 
follows because 


hanic Fa (Jer + NN). gh°ghI,h9, = 0, 
and Stab = LPI CAM Pe = We Jace = Jena: 


The curvature tensor R’¢,., of the induced metric h can be related 
to the curvature tensor R¢,., on 0(Y) and the second fundamental 
form x as follows. If ¥Y €H is a vector field on 0(Y), then 


U | eee 
R Tied ye= Ve Veg 
Now 


V0 = (Yate = (¥%,pM_AS,), -haghigh*, 
= Ye, ht hl ght, + Ye, ;7,0,,hogh*,h*, F Ye, WIN; ho high*, 
and Ye, Aht 4 = (YN,). 4 g— VN,, hg = — YN,, hb g, 
since Y%, = 0 on (A), therefore 
Tyg Rh Mca t NoaN et Neck a) 
Since this holds for all YeH, 
Re ca = Regn h? hh Wg t x oXna FX aXe (2.34) 
This is known as Gauss’ equation. 
Contracting this equatidn on a and ¢ and multiplying by h?4, one 
obtains the curvature scalar A’ of the induced metric: 
nm = RFR n? + (ye,)* + ON (2.35) 


One can derive another relation between the second fundamental 
form and the curvature tensor R%,,, on 6(Y) by subtracting the 


pepregaons (X*a)iw Se (n*, gh4,). eh, 
and aaa = (n°, gh?,h4,), sth, 
finding X o1a— Xa = Ry nth*,. (2.36) 


This is known as Codacci’s equation. 


2.8 The volume element and Gauss’ theorem 


If {E*} is a basis of one-forms, one can form from it the n-form 


e= niE!, E7A... AE”. 
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If {E*}, related to {E%} by E” = ©, E2, is another basis, the n-form 
e’ defined by this basis will be related to € by 


e’ = det (®,)e, 


so this form is not unique. However, one can use the existence of the 
metric to define (in a given basis) the form 


n= |gl?e 
where g = det (g,,). This form has components 
Nap...a = 1! |g|? b%,q0% ... Oa. 


The transformation law for g will just cancel the determinant, 
det (®%,), provided that det (®2’,) > 0. Therefore if -# is orientable 
the n-forms yn defined by coordinate bases of an oriented atlas will be 
identical, i.e. given an orientation of .@, one can define a unique 
n-form field n, the canonical n-form, on M. 

The contravariant antisymmetric tensor 


IF = gregh i ae 
has components 


gd = (—)Kn-n! |g|t Sl, 6%, ... 62 


nN) 


where s is the signature of g (so 4(n—s) is the number of negative 
eigenvalues of the matrix of metric components (g,,)). Therefore these 
tensors satisfy the relations 


dy 5, = (—)RO-9n! 84,82, ... 0%). (2.37) 


The Christoffel relations imply that the covariant derivatives of 
Nad... and 4% with respect to the connection defined by the metric 


vanish, i.e. 
ab...0 =e 
q RCA ae 0 = Nab...dse 


Using the canonical n-form, one can define the volume (with respect 
to the metric §) of an n-dimensional submanifold Y of .# as i Y. 
u 


Thus y can be regarded as a positive definite volume measure on .#. 
We shall often use it in this sense, and shall denote it by dv. Note that 
d is not meant to represent the exterior differential operator here; dy 
is simply a measure on -#. If f is a function on -#, one can define its 
integral over Y with respect to this volume measure as 


jie [om 
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With respect to local oriented coordinates {x}, this can be expressed 
as the multiple integral 


| Ff \g|# da! da? ... dx, 
a 


which is invariant under a change of coordinates. 
If X is a vector field on -M, its contraction with y will be an (n— 1)- 
form field X .y, where 


(X.)y.a = X Mav... a: 


This (n—1)-form may be integrated over any (n—1)-dimensional 
compact orientable submanifold ¥. We write this integral as 


| xeac, = | con 
a a 


where the canonical form y is regarded as defining a measure-valued 
form do, on the submanifold W. If the orientation of VY is given by 
the direction of the normal form n,, then do, can be expressed as 
n,do where do is a positive definite volume measure on the sub- 
manifold ¥. The volume measure do is not unique unless the normal 
n, is normalized. If n, is normalized to unit magnitude in a metric § 
on @,i.e.n,n,9° = +1, then do is equal to the volume measure on 1 
defined by the induced metric on Y (to see this, simply choose an 
orthonormal basis with Mage” as one of the basis vectors). 

Using the canonical form, one can derive Gauss’ formula from 
Stokes’ theorem: for any compact n-dimensional submanifold Y of 4, 
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on using relation (2.37) twice. Therefore 
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holds for any vector field X; this is Gauss’ theorem. Note that the 
orientation on Y for which this theorem is valid is that given by the 
normal form y such that <n, X) is positive if X is a vector which points 
out of Y. If the metric ¢ is such that gn, n, is negative, the vector 
g@n, will point into %. 


2.9 Fibre bundles 


Some of the geometrical properties of a manifold @ can be most 
easily examined by constructing a manifold called a fibre bundle, 
which is locally a direct product of .@ and a suitable space. In this 
section we shall give the definition of a fibre bundle and shall consider 
four examples that will be used later: the tangent bundle T(-@), the 
tensor bundle 77(.@), the bundle of linear frames or bases L(.@), and 
the bundle of orthonormal frames O(.4). 

A C* bundle over a C* (s > k) manifold 4 is a C* manifold & and 
a C* surjective map 7: €& ->.M. The manifold @ is called the total space, 
M is called the base space and 7 the projection. Where no confusion 
can arise, we will denote the bundle simply by &. In general, the 
inverse image 7—1(p) of a point pe.W need not be homeomorphic to 
m—1(q) for another point gé-#. The simplest example of a bundle is 
a product bundle (4 x Sf, M,7) where & is some manifold and the 
projection 7 is defined by 7(p, v) = p for all pe.W, ve W. For example, 
if one chooses -# as the circle S! and .o~/ as the real line R!, one con- 
structs the cylinder C? as a product bundle over S!. 

A bundle which is locally a product bundle is called a fibre bundle. 
Thus a bundle is a fibre bundle with fibre F if there exists a neighbour- 
hood Y of each point g of @ such that 7—1(%) is isomorphic with Y x F, 
in the sense that for each point pe W there is a diffeomorphism Py of 
m\(p) onto F such that the map y defined by y(u) = (7(u), },.,)) is 
a diffeomorphism yr: 7-(W)>+&x F. Since .M is paracompact, we 
can choose a locally finite covering of W by such open sets Y,. If 
U, and U, are two members of such a covering, the map 


(Pa, ») ° (Dp, ed) 


is a diffeomorphism of F onto itself for each pe (YW, 1 YW y)- The inverse 
images 7 \(p) of points pe.W are therefore necessarily all diffeo- 
morphic to ¥ (and so to each other). For example, the Mébius strip 
is a fibre bundle over S! with fibre R!; we need two open sets %,, qe 
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to give a covering by sets of the form Y; x R!. This example shows that 
if a manifold is locally the direct product of two other manifolds, it is 
nevertheless not, in general, a product manifold; it is for this reason 
that the concept of a fibre bundle is so useful. 

The tangent bundle T() is the fibre bundle over a C* manifold .# 


obtained by giving the set & = U T,, its natural manifold structure 
DEM 


and its natural projection into .. Thus the projection 7 maps each 
point of 7,, into p. The manifold structure in & is defined by local 
coordinates {z4} in the following way. Let {x*} be local coordinates in 
an open set Y of -@. Then any vector VeT7, (for any peZ) can be 
expressed as V = J'é/éx'|,. The coordinates {24} are defined in 
m™\(U) by {z4} = {x?, V3}. On choosing a covering of W by coordinate 
neighbourhoods Y,, the corresponding charts define a C*-! atlas on & 
which turn it into a C*—! manifold (of dimension 72); to check this, one 
needs only note that in any overlap (Y%, N Y,) the coordinates {x*,} of 
a point are C* functions of the coordinates {z‘,} of the point, and the 
components {V’7,} of a vector field are C*—! functions of the compo- 
nents {V°,} of the vector field. Thus in 7-(Y, 1 W,), the coordinates 
{z4,} are C* functions of the coordinates {z4,}. 

The fibre 7~1(p) is T,, and so is a vector space of dimension n. This 
vector space structure is preserved by the map ¢, ,,: T,> Rk”, which 
is given by 9, ,(u) = Vu), i.e. 6,,, maps a vector at p into its com- 
ponents with respect to the coordinates {x*,}. If {x*,} are another set 
of local coordinates thef the map (¢,,,)0(¢, ,~') is a linear map of 
R” onto itself. Thus it is an element of the general linear group 
GL(n, R) (the group of all non-singular n x m matrices). 

The bundle of tensors of type (r,s) over @, denoted by T%(.4), is 


defined in a very similar way. One forms the set € = L) T%(p), defines 
pet 


the projection 7 as mapping each point in 7%(p) into 7, and, for any 
coordinate neighbourhood @ in -%, assigns local coordinates {z4} to 
nm (NH) by {24} = {xt, 7%, 4} where {x*} are the coordinates of the 
point p and {7,4} are the coordinate components of T (that is, 
T=», ,0/é2*@...@dzx4|,,). This turns & into a C*—! manifold of 
dimension n’+*+!; any point wu in 7%(.W) corresponds to a unique 
tensor T of type (7,8) at m(u). 

The bundle of linear frames (or bases) L(-@) is a C*—! fibre bundle 
defined as follows: the total space & consists of all bases at all points 
of .@, that is all sets of non-zero linearly independent n-tuples of 
vectors {E,}, E,€ 7, for each p€.@ (aruns from 1 ton). The projection 
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7 is the natural one which maps a basis at a point p to the point p. If 
{x'\ are local coordinates in an open set Yc M, then 


{24} = {a, HY, EF, ..., H,™} 
are local coordinates in 7—1(%), where E,/ is the jth components of the 
vector E, with respect to the coordinate bases é/éx*. The general 
linear group GL(n, R) acts on L(-#) in the following way: if {E,} is 
a basis at pe.W, then AcGL(n, R) maps u = {p, E,} to 
A(u) = {p, A qr Ey}. 

When there is a metric § of signature s on -@, one can define a sub- 
bundle of L(.M), the bundle of orthonormal frames O(M), which con- 
sists of orthonormal bases (with respect to g) at all points of 4. 
O(.M) is acted on by the subgroup O(}(n+5), $(n—s)) of GL(n, R). 
This consists of the non-singular real matrices A, such that 

A gy GincA ae = Goa 
where G,, is the matrix 
diag (4+- 15-7 1,....-- 1,1) 1. — 2). 
4(n+s) terms 4(n—s) terms 
It maps (p, E,)€O(4) to (p, A,, E,) € O(-@). In the case of a Lorentz 
metric (i.e. s = n— 2), the group O(n — 1, 1) is called the n-dimensional 
Lorentz group. 

A C* cross-section of a bundle is a C’ map 9: H> & such that 710 ® 
is the identity map on -#; thus a cross-section is a C’ assignment to 
each point p of -@ of an element ®(p) of the fibre 7—1(p). A cross- 
section of the tangent bundle 7'(.@) is a vector field on .#; a cross- 
section of 7%(.#) is a tensor field of type (7, s) on .#; a cross-section of 
L() is a set of m non-zero vector fields {E,} which are linearly inde- 
pendent at each point, and a cross-section of O(.@) is a set of ortho- 
normal vector fields on -#. 

Since the zero vectors and tensors define cross-sections in 7(.#) and 
T’(.M), these fibre bundles will always admit cross-sections. If .# is 
orientable and non-compact, or is compact with vanishing Euler 
number, there will exist nowhere zero vector fields, and hence cross- 
sections of T'(.#) which are nowhere zero. The bundles L(.#) and 
O(-M#) may or may not admit cross-sections; for example L(S?) does 
not, but L(R”) does. If L(.W) admits a cross-section, -# is said to be 
parallelizable. R.P.Geroch has shown (1968c) that a non-compact 
four-dimensional Lorentz manifold .@ admits a spinor structure if 
and only if it is parallelizable. 
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One can describe a connection on -@ in an elegant geometrical way 
in terms of the fibre bundle L(.#@ ). A connection on .# may be regarded 
as a rule for parallelly transporting vectors along any curve y(t) in 4%. 
Thus if {E,} is a basis at a point p = Y(to), i.e. {p, E,} is a point u in 
L(-#), one can obtain a unique basis at any other point y(t), Le. a 
unique point y(t) in the fibre 7-1(y(t)), by parallelly transporting {E,} 
along y(t). Therefore there is a unique curve Y(t) in L(.#), called the 
lift of y(t), such that: 

(1) V(to) = u, 

(2) m(¥(t)) = y(é), 

(3) the basis represented by the point 7(t) is parallelly transported 
along the curve y(t) in -@. 

In terms of the local coordinates {24}, the curve 7(t) is given by 
{x%(y(t)), Ep, (t)}, where 


dz, (t) 
dt 


xy detly(t)) — 
+#,JT = 0. 

Consider the tangent space T,(Z(-@)) to the fibre bundle L(@) at 
the point wu. This has a coordinate basis {2/éz4|,,}. The n-dimensional 
subspace spanned by the tangent vectors {(4/ét)=,)|,,} to the lifts of all 
curves y(f) through p is called the horizontal subspace H,, of T,(L(.@)). 
In terms of local coordinates, 


& _ dr(y(t)) @ dE _t @ 


at}; 7 dt dx** dt dE, 


dx(y(t)) ( é aks 
- dt (ea Bal arzprs) 

so a coordinate basis of H,, is {e/éx*— E,,/ 1*,, 6/é#,,"}. Thus the con- 
nection in -# determines the horizontal subspaces in the tangent spaces 
at each point of L(.#). Conversely, a connection in -# may be defined 
by giving an n-dimensional subspace of T,(L(-#)) for each we L(#) 
with the properties: 

(1) If AcGL(n, BR), then the map Ay: T,(L(4))>7T4,(L(M)) 
maps the horizontal subspace H,, into H,.,); 

(2) H,, contains no non-zero vector belonging to the vertical sub- 
space V,. 

Here, the vertical subspace V, is defined as the n?-dimensional 
subspace of 7),(L(.@)) spanned by the vectors tangent to curves in the 
fibre 7—1(7(w)); in terms of local coordinates, V, is spanned by the 


3 HLS 
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vectors {0/2E,,*'. Property (2) implies that 7, is the direct sum of H,, 
and V,. 

The projection map 7: L(.@)—>- induces a surjective linear map 
1: T,(L(M)) > Ta(4), such that 7,,(V,) = 0 and 7, restricted to H,, 
is 1-1 onto 7\,,). Thus the inverse 7,,~1 is a linear map of T,,y(@) 
onto H,,. Therefore for any vector X € T,,(.@) and point ue7(p), there 
is a unique vector K EH, called the horizontal lift of X, such that 
m(X) = X. Given a curve y(t) in.@, and an initial point win 7 (y(¢,)), 
one can construct a unique curve y(t) in L(.#), where y(t) is the curve 
through wu whose tangent vector is the horizontal lift of the tangent 
vector of y(t) in .@. Thus knowing the horizontal subspaces at each 
point in L(-#), one can define parallel propagation of bases along any 
curve y(t) in @. One can then define the covariant derivative along 
y(t) of any tensor field T by taking the ordinary derivatives with 
respect to ¢, of the components of T with respect to a parallelly 
propagated basis. 

If there is a metric § on 4 whose covariant derivative is zero, then 
orthonormal frames are parallelly prepagated into orthonormal 
frames. Thus the horizontal subspaces are tangent to O(.#) in L(.#@), 
and define a connection in O(-@),. 

Similarly a connection on .@ defines n-dimensional horizontal sub- 
spaces in the tangent spaces to the bundles T(.#) and T%(.@), by 
parallel propagation of vectors and tensors. These horizontal sub- 
spaces have coordinate bases 


fame ee (all upper indices) 


—T%.b, TF,,.—(all lower indices) | ra —| 


respectively. As with L(.@), 7, maps these horizontal subspaces 
one-one onto 7" hu) ); thus again 77, can be inverted to give a unique 
horizontal lift K € 7), of any vector Xe7",,). In the particular case of 
T(), u itself corresponds to a unique vector WeT,,(4), and so 
there is an intrinsic horizontal vector field W defined on 7 (#) by the 
connection. In terms of local coordinates {x7, V*}, 


= Tabs eee ire 
v(— Velo.) 
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This vector field may be interpreted as follows: the integral curve of W 
through wu = (p, X)e€T(.#) is the horizontal lift of the geodesic in # 
with tangent vector X at p. Thus the vector field W represents all 
geodesics on .-@. In particular, the family of all geodesics through 
ped is the family of integral curves of W through the fibre 
a (p) < T(); the curves in -@ have self intersections at least at p, 
but the curves in 7(.#) are non-intersecting everywhere. 


3-2 


3 
General Relativity 


In order to discuss the occurrence of singularities and the possible 
breakdown of General Relativity, it is important to have a precise 
statement of the theory and to indicate to what extent it is unique. 
We shall therefore present the theory as a number of postulates about 
a mathematical model for space-time. 

In §3.1 we introduce the mathematical model and in § 3.2 the first 
two postulates, local causality and local energy conservation. These 
postulates are common to both Special and General Relativity, and 
thus may be regarded as tested by the many experiments that have 
been performed to check the former. In §3.3 we derive the equations 
of the matter fields and obtain the energy-momentum tensor from a 
Lagrangian. 

The third postulate, the field equations, is given in §3.4. This is not 
so well established experimentally as the first two postulates, but we 
shall see that any alternative equations would seem to have one or 
more undesirable properties, or else require the existence of extra 
fields which have not yet been detected experimentally. 


3.1 The space-time manifold 


The mathematical model we shall use for space-time, i.e. the collection 
of all events, is a pair (.4,¢) where .@ is a connected four-dimensional 
Hausdorff Ce manifold and g is a Lorentz metric (i.e. a metric of 
signature +2) on &. 

Two models (.@,8) and (.#’,$') will be taken to be equivalent if 
they are isometric, that is if there is a diffeomorphism 6:.#@—>.#' 
which carries the metric § into the metric $’, i.e. 0,8 = g'. Strictly 
speaking then, the model for space-time is not just one pair (@, §) 
but a whole equivalence class of all pairs (-#’, &’) which are equivalent 
to (4,8). We shall normally work with just one representative mem- 
ber (.4, 8) of the equivalence class, but the fact that this pair is defined 
only up to equivalence is important in some situations, in particular 
in the discussion of the Cauchy problem in chapter 7. 

[ 56 ] 
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The manifold .# is taken to be connected since we would have no 
knowledge of any disconnected component. It istaken to be Hausdorff 
since this seems to accord with normal experience. However in 
chapter 5 we shall consider an example in which one might dispense 
with this condition. Together with the existence of a Lorentz metric, 
the Hausdorff condition implies that .W is paracompact (Geroch 
(1968c)). 

A manifold corresponds naturally to our intuitive ideas of the con- 
tinuity of space and time. So far this continuity has been established 
for distances down to about 10-'cm by experiments on pion scat- 
tering (Foley et al. (1967)). It may be difficult to extend this to much 
smaller lengths as to do so would require a particle of such high energy 
that several other particles might be created and confuse the experi- 
ment. Thus it may be that a manifold model for space-time is inap- 
propriate for distances less than 10-!%cm and that we should use 
theories in which space-time has some other structure on this scale. 
However such breakdowns of the manifold picture would not be 
expected to affect General Relativity until the typical gravitational 
length scale became of that order. This would happen when the density 
became about 10°° gm cm, which is a condition so extreme as to be 
completely beyond our present knowledge. Nevertheless, by adopting 
a manifold model for space-time, and making certain other reasonable 
assumptions, we shall show in chapters 8-10 that some breakdowns 
of General Relativity myst Occur. It may be the field equations that 
go wrong, or it may be that quantization of the metric is needed, or it 
may be a breakdown of the manifold structure itself that occurs. 

The metric ¢ enables the non-zero vectors at a point pe. to be 
divided into three classes: a non-zero vector Xe¢7J, being said to be 
timelike, spacelike or null according to whether g(X, X) is negative, 
positive or zero respectively (cf. figure 5). 

The order of differentiability, r, of the metric ought to be sufficient 
for the field equations to be defined. They can be defined in a distribu- 
tional sense if the metric coordinate components g,, and g® are con- 
tinuous and have locally square integrable generalized first derivatives 
with respect to the local coordinates. (A set of functions f,, on R” are 
said to be the generalized derivatives of a function f on R” if, for any 
C@ function y on R” with compact support, 


[fievrare = — | feyp[o0a) are.) 
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However this condition is too weak, since it guarantees neither the 
existence nor the uniqueness of geodesics, for which a C?- metric is 
required. (A O?- metric is one for which the first coordinate derivatives 
of the metric coordinate components satisfy a local Lipschitz condi- 
tion, see § 2.1.) We shall in fact assume for most of the book that the 
metric is at least C?. This allows the field equations (which involve the 
second derivatives of the metric) to be defined at every point. In § 8.4 
we shall weaken the condition on the metric to C?- and show that this 
does not affect the results on the occurrence of singularities. 

In chapter 7, we use a different kind of differentiability condition 
in order to show that the time development of the field equations is 
determined by suitable initial conditions. We require there that the 
metric components and their generalized first derivatives up to order 
m(m > 4) are locally square integrable. This would certainly be true if 
the metric were C*. 

In fact, the order of differentiability of the metric is probably not 
physically significant. Since one can never measure the metric exactly, 
but only with some margin of error, one could never determine that 
there was an actual discontinuity in its derivatives of any order. Thus 
one can always represent one’s measurements by a C® metric. 

If the metric is assumed to be C’, the atlas of the manifold must be 
Cr+1, However, one can always find an analytic subatlas in any C* atlas 
(s > 1) (Whitney (1936), cf. Munkres (1954)). Thus it is no restriction 
to assume from the start that the atlas is analytic, even though one 
could physically determine only a C’+! atlas if the metric were O". 

We have to impose some condition on our model (.4,¢) to ensure 
that it includes all the non-singular points of space-time. We shall say 
that the C” pair (.#’,8') is a C’-extension of (.@,§) if there is an iso- 
metric C’ imbedding “: W—>-M’. If there were such an extension 
(.@’, 8’) we should have to regard points of .@ as also being points of 
space-time. We therefore require that the model (.4,§) is Cr- 
inextendible, that is there is no C” extension (.W’, 8’) of (4, §) where 
f(M) does not equal .#’. 

As an example of a pair (M,, $,) which is not inextendible, consider 
two-dimensional Euclidean space with the x-axis removed between 
az, = —1 and x, = +1. The obvious way to extend this would simply 
be to replace the missing points, but one could also extend it by taking 
another copy (.@2, 8.) of the space, and identifying the bottom side 
of the x-axis for |x,| < 1 with the top side of the «,-axis for eo ee IL 
and also identifying the top side of the «,-axis for |z,| < 1 with the 
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bottom side of the x,-axis for |x,| < 1. The resultant space (M3, £,) is 
inextendible but not complete as we have left out the points x, = + 1, 
y, = 0. We cannot put these points back in because we were perverse 
enough to extend the top and bottom sides of the x-axis on different 
sheets. If however one takes the subset Y of , defined by 1 < x, < 2, 
—1<y, < 1, then one could extend the pair (Y,,|,) and put back 
the point x, = 1, y, = 0. This motivates a rather stronger definition of 
inextendibility: a pair (.@,8) is said to be C’-locally inextendible if 
there is no open set Y < .W with non-compact closure in 4, such that 
the pair (W, g|,,) has an extension (%’, §’) in which the closure of the 
image of Y is compact. 


3.2. The matter fields 


There will be various fields on 4, such as the electromagnetic field, the 
neutrino field, etc., which describe the matter content of space-time. 
These fields will obey equations which can be expressed as relations 
between tensors on -@ in which all derivatives with respect to position 
are covariant derivatives with respect to the symmetric connection 
defined by the metric g. This is so because the only relations defined 
by a manifold structure are tensor relations, and the only connec- 
tion defined so far is that given by the metric. If there were another 
connection on .%, the difference between the two connections would 
be a tensor and could be regarded as another physical field. Similarly 
another metric on -# could be regarded as a further physical field. 
(The equations of the matter fields are sometimes expressed as 
relations between spinors on #. We do not deal with such relations 
in this book, as they are not needed for the problems we wish to 
consider. In fact, all spinor equations can be replaced by rather more 
complicated tensor equations; see e.g. Ruse (1937).) 

The theory one obtains depends on what matter fields one incorpo- 
rates in it. One should of course include all such fields which have been 
experimentally observed, but one might postulate the existence of as 
yet undetected fields. Thus for example Brans and Dicke (Dicke 
(1964), appendix 7) postulate the existence of a long range scalar field 
which is weakly coupled to the trace of the energy-momentum tensor. 
In the form given in Dicke (1964) appendix 2, the Brans—Dicke theory 
can be regarded simply as General Relativity with an extra scalar 
field. Whether this scalar field has been experimentally detected or 
not is at present under dispute. 
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We shall denote the matter fields included in the theory by 
Wye, gy where the subscript (i) numbers the fields considered. The 
following two postulates on the nature of the equations obeyed by the 
Ww% 8. g¢ are common to both the Special and the General Theories 
of Relativity. 


Postulate (a): Local causality 


The equations governing the matter fields must be such that if Y is 
a convex normal neighbourhood and p and q are points in WY then a 
signal can be sent in YW between p and q if and only if p and q can be 
joined by a C! curve lying entirely in Y, whose tangent vector is every- 
where non-zero and is either timelike or null; we shall call such a curve, 
non-spacelike. (Our formulation of relativity excludes the possibility 
of particles such as tachyons, which move on spacelike curves.) 
Whether the signal is sent from p to q or from q to p will depend on the 
direction of time in Y. The problem of whether a consistent direction 
of time can be assigned at all points of space-time will be considered 
in § 6.2. . 

A more precise statement of this postulate can be given in terms of 
the Cauchy problem of the matter fields. Let p ¢ WY be such that every 
non-spacelike curve through p intersects the spacelike surface x4 = 0 
within Y. Let F be the set of points in the surface x4 = 0 which can be 
reached by non-spacelike curves in Y from p. Then we require that the 
values of the matter fields at p must be uniquely determined by the 
values of the fields and their derivatives up to some finite order on F, 
and that they are not uniquely determined by the values on any 
proper subset of ¥ to which it can be continuously retracted. (For 
a fuller discussion of the Cauchy problem, see chapter 7.) 

It is this postulate which sets the metric § apart from the other 
fields on .@ and gives it its distinctive geometrical character. If {x7} are 
normal coordinates in Y about 9, it is intuitively fairly obvious (and 
is proved in chapter 4) that the points which can be reached from p by 
non-spacelike curves in YW are those whose coordinates satisfy 


(at)? + (x)? + (a3)? — (x4)? < 0. 


The boundary of these points is formed by the image of the null cone 
of p under the exponential map, that is the set of all null geodesics 
through p. Thus by observing which points can communicate with Dp, 
one can determine the null cone W, in T,,. Once N, is known, the metric 
at p may be determined up to a conformal factor. This may be seen as 
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follows: let X,Y e€7,, be respectively timelike and spacelike vectors. 
The equation 


g(X+AY, X+AY) = g(X, X) + 2Ag(X, Y) + A2g(Y, Y) 
= 


will have two real roots A, and A, as g(X, X) < 0 and g(Y, Y) > 0. If 
N, is known, A, and A, may be determined. But 


Ay Az = 9(X, X)/9(Y, Y). 


Thus the ratio of the magnitudes of a timelike vector and a spacelike 
vector may be found from the null cone. Then if W and Z are any two 
non-null vectors at p, 


g(W, Z) = 3(g(W, W) + 9(Z, Z)-g(W+Z,W+Z)). 


Each of the magnitudes on the right-hand side may be compared with 
the magnitude of either X or Y, and so g(W, Z)/g(X, X) may be found. 
(If W+Z is null, the corresponding expression involving W+ 2Z 
could be used.) Thus observation of local causality enables one to 
measure the metric up to a conformal factor. In practice this measure- 
ment is performed most conveniently using the experimental fact that 
no signal has been observed to travel faster than electromagnetic 
radiation. This means that light must travel on null geodesics. This 
however is a consequence of the particular equations the electro- 
magnetic field obeys, not of the theory of relativity itself. Causality 
will be considered further in chapter 6. Among other results, it will be 
shown that causal relations may be used to determine the topological 
structure of .#. The conformal factor in the metric may be determined 
using postulate (b) below; thus all the elements of the theory will be 
physically observable. 


Postulate (b): Local conservation of energy and momentum 


The equations governing the matter fields are such that there exists 
a symmetric tensor 7°, called the energy-momentum tensor, which 
depends on the fields, their covariant derivatives, and the metric, and 
which has the properties: 

(i) 7’ vanishes on an open set % if and only if all the matter fields 
vanish on %, 

(ii) T° obeys the equation 


[> , = 0, (3.1) 
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Condition (i) expresses the principle that all fields have energy. One 
might possibly object to the ‘only if’ on the grounds that there might 
be two non-zero fields, one of whose energy-momentum tensor exactly 
cancelled that of the other. This possibility is related to that of the 
existence of negative energy which will be discussed in § 3.3. 

If the metric admits a Killing vector field K, equations (3.1) can be 
integrated to give a conservation law. To see this, define P* to be the 
vector whose components are P? = T%K,. Then, 


Po, = 7%, K,+ TK, 9. 


The first term is zero by the conservation equations, and the second 
vanishes as 7’ is symmetric and 2K(q.,) = Lega, = 9, since K is a 
Killing vector. Thus if 9 is a compact orientable region with boundary 
09, Gauss’ theorem (§2.7) shows 


i Pdo, = | P?.,dv = 0. (3.2) 
6g g 


This may be interpreted as saying that the total flux over a closed 
surface of the K-component of energy-momentum is zero. 

When the metric is flat, as it is in the Special Theory of Relativity, 
one may choose coordinates {x*} in which the components of the metric 
a©e Jan = €q 5p (NO SUMMation) where d,, is the Kronecker delta and 
e, is —1 if a= 4 and is +1 if a= 1,2,3. Then the following are 
Killing vectors: L = d/éx" (a =1,2,3, 4) 


(these generate four translations) and 


a é : 
aH = 622% Oat — €gxh rae (no summation; «, § = 1, 2, 3, 4) 
(these generate six ‘rotations’ in space-time). These isometries form 
the ten-parameter Lie group of isometries of flat space-time known as 
the inhomogeneous Lorentz group. One may use them to define ten 


vectors = and : which will obey (3.2). We may think of A as repre- 
senting a -_ of energy and P, By “ as the flow of the ines compo- 


nents of linear momentum. The P can be interpreted as the flow of 
ap 
angular momentum. 


If the metric is not flat there will not, in general, be any Killing 
vectors and so the above integral conservation laws will not hold. How- 
ever, in a suitable neighbourhood of a point g one may introduce 
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normal coordinates {x7}. Then at q the components g,, of the metric are 
€,9, (no summation), and the components [¢,, of the connection are 
zero. One may take a neighbourhood @ of q in which theg,, and T,, 
differ frorn their values at g by an arbitrarily small amount; then the 
Lig, 1) and Ma: ») Will not exactly vanish in Z, but will in this neigh- 
fe 4 a 


bourhood differ from zero by an arbitrarily small amount. Thus 


P'do, and P’do, 
aGa a9 af 

will still be zero in the first approximation; that is to say, one still has 
approximate conservation of energy, momentum and angular 
mormenturn in a small region of space-time. Using this it can be shown 
that a small isolated body moves approximately on a timelike geodesic 
curve independent of its internal constitution provided that the energy 
density of matter in it is non-negative (for an account of the motion of 
a srnall body in relativity, see Dixon (1970)). This may be thought of 
as Galileo's principle that all bodies fall equally fast. In Newtonian 
terms one would say that the inertial mass (the m in F = ma) and the 
passive gravitational mass (the mass acted on by a gravitational field) 
are equal for all bodies. This has been verified to a high order of 
accuracy in experiments by Eotvos and by Dicke (1964). 

Postulate (a) enables one to measure the metric up to a conformal 
factor at each point. Using postulate (L) one may relate these factors 
at different points, for the conservation equations T%., = 0 would not 
in general hold for a connection derived from a metric 8 = °g. One 
way of doing this would be to observe the paths of small ‘test’ particles 
and so to determine the timelike geodesic curves. Then if y(t) is such a 
curve with tangent vector K = (¢/ét),, one has from (2.29) 

D D =! bKa a C ad{) 
gta G+ 20 0, KeK*—O-Y K°K 9,2) g4Q. 4. 
Since y(t) is a geodesic with respect to the space-time metric §, 


K®(D/ét) K9 = 0. Thus 


Ko? Ka = —(K°K%G,,) °g™ (log 0)... (3.3) 
é 

Knowing the conformal structure, one can choose a metric & which 
represents the conformal equivalence class of metrics and can evaluate 
the left-hand side of (3.3) for any test particle. Then the right-hand side 
of (3.3) determines (log Q).,, up to the addition of a multiple of A°g,,. 
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By considering another curve y’(t) whose tangent vector K’@ is not 
parallel to K%, one can find (log Q),, and so can determine Q every- 
where up to a constant multiplying factor. This constant factor 
specifies one’s units of measurement, and so can be chosen arbitrarily. 

This is, of course, not the way one measures the conformal factor in 
practice; one makes use of the fact that there exist a large number of 
similar systems (such as the electronic states of atoms) whose internal 
motions define a number of events along the timelike curve which 
represents their position in space-time. The intervals between these 
events seem to be independent of their past history in the sense that 
the intervals measured by two nearby systems correspond. If one can 
effectively isolate them against external matter fields (so they must 
move on geodesic curves) and if one assumes their internal motion is 
independent of the curvature of space-time, then the only thing it can 
depend on is the metric. Thus the arc-length between two successive 
events on a curve must be the same for each pair of successive events 
on any such curve. If one takes this arc-length as one’s unit of measure- 
ment, one can determine the conformal factor at any point of space— 
time. 

In fact it may not be possible to isolate a system from external 
matter fields. Thus for example in the Brans—Dicke theory there is 
a scalar field which is non-zero everywhere. However the conformal 
factor can still be determined by the requirement that the conserva- 
tion equation 7'%°,, = 0 should hold. Thus knowledge of the energy— 
momentum tensor T,, determines the conformal factor. 


3.3 Lagrangian formulation 


The conditions (i) and (ii) of postulate (b) do not tell one how to con- 
struct the energy-momentum tensor for a given set of fields, or whether 
it is unique. In practice one relies heavily on one’s intuitive knowledge 
of what energy and momentum are. However, there is a definite and 
unique formula for the energy-momentum tensor in the case that the 
equations of the fields can be derived from a Lagrangian. 

Let L be the Lagrangian which is some scalar function of the fields 
Pw" ?...a their first covariant derivatives, and the metric. One 
obtains the equations of the fields by requiring that the action 


r={ Ldv 
2 
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be stationary under variations of the fields in the interior of a compact 
four-dimensional region Y. By a variation of the fields Vint, ,inD 
we mean a one-parameter family of fields ¥.,(w,7) where we ( —€,€) 
and re.@, such that 

(i) Po(0,r) = P(r), 

(i) Wi(u,r) = By(r) when rew—-GQ. 


We denote OF y(u,r)/eul,9 by AW. 
Then 


ol 
ou 


ol 
ca —_——____ Af, .4...0 
wee = (a (i) c...d 
ob 


or he me) dv, 


where ‘Y)*-?. g,- are the components of the covariant derivatives 
of By. But ACP? ase) = (AV wt?o...a):¢. thus the second term 
can be expressed as 


OL 
Ellice 
Cecio, 9 eer) 


oL 
= | A ee, | dv 
——— 42 a 


The first term in this expression can be written as 


i Q*, , dv -{ OF da. 
9 aD 


where Q is a vector whose components are 


oL 
= Sy oo AF yt? a 
QUtG ae CO 


CF 
This integral is zero as condition (ii) is the statement that AY, vanish 
at the boundary ¢Y. Thus in order that éI/éu|,_) should vanish for 
all variations on all volumes J, it is necessary and sufficient that the 


Euler—Lagrange equations, 


oL ( oL 
ee ee | | ee |) (3.4) 
Gt erg Cine s ae se 


hold for all 7. These are the equations of the fields. 
We obtain the energy-momentum tensor from the Lagrangian by 
considering the change in the action induced by a change in the metric. 
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Suppose a variation g,,(u,7) leaves the fields Y)%®, zg unchanged 
but alters the components g,, of the metric. Then 


ol ol A oL 
pass = ee AE ee. . es - dy 
ou u=0 oa ( ” ave) Jap wi 
i | — Aga. (3.5) 
B Jab 


The last term arises because the volume measure dv depends on the 
metric, and so will vary when the metric is varied. To evaluate this 
term, recall that dv is in fact the canonical four-form 7 whose compo- 
nents are Yapeg = (—9)24! 0[,10,26,3 644, where g = det (g,,). Therefore 


Marea 1 4 og 
Maved _ _ 4 g)-4 9 419, 16.2535,8 
Ber 3(—g) Gg ote Ob Fe: Oa 


= —$(—g) t9% 94! 6,1 6,7 6,3 da! 
= 397 Napea: 


Thus alee) = $9 dv. 
Jar 

The first term in (3.5) arises because A(Py)28, d:e) Will not neces- 
sarily be zero even though AY¥()*--5, 4 is, since the variation in the 
metric will induce a variation in the components ¢,,, of the connection, 
As the difference between two connections transforms like a tensor, 
Af¢,, may be regarded as the components of a tensor. They are related 
to the variation in the components of the metric by 

AT, = 29° {(Agap). oe (Agae). ae (Agy.)- a}: 

(The easiest way to derive this formula is to note that since it isa tensor 
relation, it must be valid in any coordinate system. In particular, one 
could choose normal coordinates about a point p. For these coordinates 
the components I¢,, and the coordinate derivatives of the components 
Jap Vanish at p. The formula given can then be verified to hold at a) 
Using this relation, AW()¢->, _ d;e May be expressed in terms of 
(Ag,-), ¢ and the usual integration by parts employed to give an inte- 
grand involving Ag,, only. Thus we may write 0I/du as 


| (TAg,,) do, 
27] 


where 7” are the components of a symmetric tensor which is taken 
to be the energy-momentum tensor of the fields. (See Rosenfeld (1940) 
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for the relation between this tensor and the so-called canonical energy— 
momentum tensor.) 

This energy-momentum tensor satisfies the conservation equations 
as a consequence of the field equations obeyed by the ¥,,2--8, 4. For 
suppose one has a diffeomorphism ¢:.W—+-W which is the identity 
everywhere except in the interior of Z. Then, by the invariance of 
integrals under a differential map, 


ve = [Lav =[ m = egies = [9m 
Thus i (Ly — 6*(Lm)) = 0. 


If the diffeomorphism ¢ is generated by a vector field X (non-zero only 
in the interior of Z) it follows that 


[ ,tx(n) = 0. 


But 
EVE oL 
Lx(Ly) = qa (asa 
[, x ”) @ = Cea eave fe 


x Dg Piyt?, adv +{. T® Ly Jap Av. 


The first term vanishes as a consequence of the field equations. In the 
second term, Lyg,, = 2X (a3): Thus 


[ ,(Ceexaas) dv = 2) ((20X,),.— 2", »X,)ae. 


The first contribution may be transformed into an integral over the 
boundary of 9 which vanishes as X is zero there. Since the second 
term must therefore be zero for arbitrary X, it follows that 7, , = 0. 

We shall now give as examples Lagrangians for some fields which 
will be of interest later. 


Example 1: A scalar field y 


This can represent, for example, the 7°-meson. The Lagrangian is 


m2 
b= -bati.9— Be 


where m, % are constants. The Euler-Lagrange equations (3.4) are 
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The energy-momentum tensor is 


2 
i ae V5aV;0—29an (Victiao+ Fv). (3.6) 


Example 2: The electromagnetic field 


This is described by a one-form A, called the potential, which is defined 
up to the addition of a gradient of a scalar function. The Lagrangian is 


1 
L=- By tao tcagg", 


where the electromagnetic field tensor F is defined as 2dA, ie. 
Fy = 2A, q. Varying A,, the Euler-Lagrange equations (3.4) are 


F, 


This and F,,..; = 9 (which is the equation dF = d(dA) = 0) are the 
Maxwell equations for the source-free electromagnetic field. The 
energy—momentum tensor is 


wer” = 0. 


1 ie 
— Tq Pac Ba S Adee tas 6) (3.7) 


Example 3: A charged scalar field 
This is really a combination of two real scalar fields 7, and y,. These 
are combined into a complex scalar field yy = ~%,+i,/,, which could 
represent, forexample, 7+ and 7— mesons. The total Lagrangian of the 
scalar field and electromagnetic field is 
— RS 

L=- (y, a ar aeel. p)gr(y, ijn ieA, y) a qe yy ~ ghar leag 9", 
where e is a constant and yf is the complex conjugate of yy. Varying 
y, w and A, independently, one obtains 


m? / . : 
yp, ab 7 of + ied, 9 (2am b as ied, y) need. pg — 0, 


and its complex conjugate, and 
1 = Se S a Bi AS _ 
Tyee of” — iey (vy. (1 iewdy, Vv) iey (yy. a ae nelle y) = 0. 


The energy-momentum tensor is 


V2 om MY aViot Vi ayo) tH—V, gied, +7, ied y 


- ? =, ” Il = 
ate Wea leat Vs ied gh) + Fichrag + PA, Ay + $1. gap. 
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Example 4: An isentropic perfect fluid 

The technique here is rather different. The fluid is described by a 
function p, called the density, and a congruence of timelike curves, 
called the flow lines. By a congruence of curves, is meant a family of 
curves, one through each point of . If @ isa sufficiently small com- 
pact region, one can represent a congruence by a diffeomorphism 
y: [a,b] x V+ where [a,b] is some closed interval of R! and / is 
some three-dimensional manifold with boundary. The curves are said 
to be timelike if their tangent vector W = (@/ét),, t¢[a, b], is timelike 
everywhere. The tangent vector V is defined by V = (—g(W, W))? W, 
so g(V, V) = —1, and the fluid current vector is defined by j = pV. It 
is required that this is conserved, ie. j*., = 0. The behaviour of the 
fluid is determined by prescribing the elastic potential (or internal 
energy) € as a function of p. The Lagrangian is taken to be 


L = —2p(1+e) 


and the action I is required to be stationary when the flow lines are 
varied and p is adjusted to keep 7? conserved. A variation of the flow 
lines is a differentiable map y: (— 46,6) x [a,b] x. /—> & such that 


y(0, [a, O14} = y([a, b),%) 
and = y(u, [a,b], VY) = y([a,6], VY) on M-G, (ue(—3,4)). 


Then it follows that AW = Lg W where the vector K is K = (6/0u),. 
This vector may be thought of as representing the displacement, amder 
the variation, of a point of the flow line. It follows that 


a eer eek 
Using the fact that A(j?.,) = 0 = (Aj*), q, one has 
(Ap). a ve4 Apr, Pg hV + piAV’). = 
Substituting for AV@ and integrating along the flow lines, one finds 
Ap = (pK*).,+pKp;-V°V’. 
Therefore the variation of the action integral is 
= i - 2] {(eK),0+ PK r:« a) (: +P) aw 
Integrating by parts, 
ol 


ae ae) re d(ép) a4 [eye we lles 
ie lg) Aten) vee”) I 
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where V4 = V¢., V°. If this is zero for all K, it follows that 
(2+ p) V2 = —p,p(g + VV"), 


where # = p(i+e) is the energy density and p = p%(de/dp) is the 
pressure. Thus V“, the acceleration of the flow lines, is given by the 
pressure gradient orthogonal to the flow lines. 

To obtain the energy-momentum tensor one varies the metric. The 
calculations may be simplified by noting that the conservation of the 
current may be expressed as 


en ee ae 
(9 a= aq an g)) ) 


Given the flow lines, the conservation equations determine j* uniquely 
at each point on a flow line in terms of its initial value at some given 
point on the same flow line. Therefore (./ — g) 77 is unchanged when the 
metric is varied. But 


0. 


x 


Pp? = 9 (Vv —9) 97 (WV —9)9") Jao 
so 2pAp = 2 Soin 
de de 
zp — > Naas ayo 2 ,ab 
and thus gh {e(l +e) +p =| V2Ve+p ae! 


= (u+p) V2V° + pg. (3.8) 


= 


We shall call any matter whose energy-momentum tensor is of the 
above form (whether or not it is derived from a Lagrangian) a perfect 
fluid. From the energy and momentum conservation equations (3.1) 
applied to (3.8) one finds 


Big V%+ (utp) Ve 0, (3.9) 

ep) Vg VV = 0. (3.10) 

These are the same as the equations derived from the Lagrangian. We 

shall call a perfect fluid isentropic if the pressure p is a function of the 

energy density uw only. In this case one can introduce a conserved 

density p and an internal energy ¢ and derive the equations and the 
energy-momentum tensor from a Lagrangian. 

One may also give the fluid a conserved electric charge e (i.e. 


J*,, = 0 where J = e V is the electric current). The Lagrangian for 
the ole fluid and the electromagnetic field is 


L= - Pp Fagg — 2p(1 ae) = 
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The last term gives the interaction between the fluid and the field. 
Then varying A, the flow lines and the metric respectively, one finds 


Dae b = 4nJ¢, 
(itp) V4 = — pg" + Vey") + Fs, J, 


: 1 
co (u +p) yeye + pg2 a rs (Fe. Foe 49°F 4 ) : 


3.4 The field equations 


So far, the metric g has not been specified. In the Special Theory of 
Relativity, which does not include gravitational effects, it is taken to 
be flat. One might think that one could include gravitation by keeping 
the metric flat and by introducing an extra field on space-time. How- 
ever, experiments have shown that light rays travelling near the sun 
are deflected. Since light rays are null geodesics, this shows that the 
space-time metric cannot be flat or even conformal to a flat metric. 
One therefore has to give some prescription for the curvature of 
space-time. It turns out that this prescription can be chosen so as to 
reproduce the results of Newtonian gravitation theory in the limit of 
small slowly varying curvature. It is therefore not necessary to intro- 
duce an extra field to describe gravitation. This is not to say that there 
could not be an additional field that produced part of the gravitational 
effects. Such a scalar field has been suggested by Jordan (1955), and 
Brans and Dicke (see Dické (1964)). However, as mentioned before, 
such an additional field could be regarded as simply another matter 
field and included in the total energy-momentum tensor. We therefore 
adopt the view that the gravitational field is represented by the 
space-time metric itself. The problem then becomes one of finding 
field equations to relate the metric to the distribution of matter. 

These equations should be tensor equations involving the matter 
only through its energy-momentum tensor, i.e. should not distinguish 
between two different matter fields which have the same distribution 
of energy and momentum. This can be regarded as a generalization of 
the Newtonian principle that the active gravitational mass of a body 
(the mass producing a gravitational field) is equal to the passive gravi- 
tational mass (the mass acted on by the gravitational field). This has 
been verified experimentally by Kreuzer (1968). 

To determine what the field equations should be, we shall consider 
the Newtonian limit. Since the Newtonian gravitational field equation 
does not involve time, the correspondence with Newtonian theory 


72 GENERAL RELATIVITY — [3.4 


should be made in a metric which is static. By a static metric is meant 
a metric which admits a timelike Killing vector field K which is 
orthogonal to a family of spacelike surfaces. These surfaces may be 
regarded as surfaces of constant time and may be labelled by the 
parameter t. We define the unit timelike vector V as f-!K, where 
f? =—K¢K,. Then V*,,=—V*h, where V@= V2.,V° = f-f.,9” 
represents the departure from geodesity of the integral curves of V 
(which are of course also integral curves of K). Note that VV, = 0. 

These integral curves define the static frame of reference, that is to 
say, the space-time metric seems to be independent of time to a 
particle whose history is one of these curves. A particle released from 
rest and following a geodesic would appear to have an initial accelera- 
tion of —V with respect to the static frame. If f differs only slightly 
from unity the initial acceleration of a freely moving particle released 
from rest is approximately minus the gradient of f. This suggests that 
one should regard f—1 as the quantity analogous to the Newtonian 
gravitational potential. 

One can derive an equation for this potential by considering the 
divergence of V*: 


oe = Uoe Ve\e oS Wren Vo [8B yee 
= Ry VAV?+ (Vo, 4); 4 V+ (Vy V9)? = Ray VAV?, 
. 

But he t= Caen gael: = Soae me Bo ee eis 
se fras V°V" = —f,qV%,,V° =—f-¥, of. 9, 
so one finds Fade + V2V2) = fhe vere 
The term on the left is the Laplacian of f with respect to the induced 
metric in the three-surface {tf = constant}. If the metric is almost flat, 
this will correspond to the Newtonian Laplacian of the potential. 
One would therefore obtain agreement with Newtonian theory in the 
limit of a weak field (i.e. when f ~ 1) if the term on the right is equal 
to 47G times the matter density plus terms which are small in the weak 


field limit. 
This will be the case if there is a relation of the form 


Ra an ee (3.11) 


where K,,, is a tensorial function of the energy-momentum tensor and 
the metric, which is such that (47G)-1K ,, V2V° is equal to the matter 
density plus terms which are small in the Newtonian limit. We shall 
for the moment assume a relation of this form. 
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Since R,, satisfies the contracted Bianchi identities Bo = Rigs 
(3.11) implies me ae (3.12) 
This shows that the apparently natural equation K,, = 4nGT,,, cannot 
be correct, since (3.12) and the conservation equations Lp =e 
would imply 7. , = 0. For a perfect fluid, for example, this would mean 
that 4 — 3p was constant throughout space-time, which is clearly not 
satisfied by a general fluid. 

In fact in general, the only first order identities satisfied by the 
energy-momentum tensor are the conservation equations. From this 
it follows that the only tensorial function K,,, of the energy-momentum 
tensor and the metric which obeys the identities (3.12) for all energy— 
momentum tensors, is 


Ky = Ky =e oon) op, (3.13) 


where « and .\ are constants. The values of these constants can be 
determined from the Newtonian limit. Consider a perfect fluid with 
energy density « and pressure p whose flow lines are the integral curves 
of the Killing vector (i.e. the fluid is at rest in the static frame). The 
energy-momentum tensor is given by (3.8). Putting this in (3.13) and 
(3.11), one finds 


F; anlg? + VV") = f(4dx(u + 8p) —A). (3.14) 


In the Newtonian limit the pressure p is normally very small compared 
to the energy density mu (We are using units in which the speed of 
light is unity. In units in which the speed of light is c, the expression 
jt+ 3p should be replaced by 4+ 3p/c?.) One would therefore obtain 
approximate agreement with Newtonian theory if « = 87G and if |A| 
is very small. We shall use units of mass in which G = 1. In these units, 
a mass of 107° gm corresponds to a length of 1cm. Sandage’s (1961, 
1968) observations of distant galaxies place limits on |A| of the order 
of 10-** em-?; we shall normally take A to be zero, but shall bear in 
mind the possibility of other values. 

One may then integrate (3.14) over a compact region F of the three- 
surface {tf = constant} and transform the left-hand side into an integral 
of the gradient of f over the bounding two-surface 0F : 


i f(dm(-+ 3p) do = i Feaslgt + VV?) do 
Fr F 


= i fralge? + V2V?) dry, 
OF 
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where da is the volume element of the three-surface {¢ = constant} in 
the induced metric, and dz, is the surface element of the two-surface 
oF in the three-surface. This gives the analogue of the Newtonian 
formula for the total mass contained within a two-surface. There are 
however two important differences from the Newtonian case: 

(i) a factor f appears in the integral on the right-hand side. This 
means that matter placed in a region where f is considerably less than 
one (a large negative Newtonian potential) makes a smaller contribu- 
tion to the total mass than does the same matter in a region where f is 
almost one (small negative Newtonian potential); 

(ii) the pressure contributes to the total mass. This means that in 
some circumstances it can actually assist rather than prevent gravita- 
tional collapse. 


The equations Rap = 82(L,4—47 Gan) + AGap 
are called the Hinstein equations and are often written in the equivalent 
form (Ray —4PGap) + AGan = 87T. (3.15) 


Since both sides are symmetric, these form a set of ten coupled non- 
linear partial differential equations in the metric and its first and 
second derivatives. However the covariant divergence of each side 
vanishes identically, that is, 


(Ro? — 3 Rg” + Ag”), = 0 . 
and ae: b => 


hold independent of the field equations. Thus the field equations really 
provide only six independent differential equations for the metric. 
This is in fact the correct number of equations to determine the space— 
time, since four of the ten components of the metric can be given 
arbitrary values by use of the four degrees of freedom to make co- 
ordinate transformations. Another way of looking at this is that two 
metrics §, and g, on a manifold .# define the same space-time if there 
is a diffeomorphism 6 which takes g, into g,. Therefore the field equa- 
tions should define the metric only up to an equivalence class under 
diffeomorphisms, and there are four degrees of freedom to make 
diffeomorphisms. 

We shall consider the Cauchy problem for the Einstein equations 
in chapter 7, and shall show that, together with the equations for the 
matter fields, they are sufficient to determine the evolution of space— 
time given suitable initial conditions, and that they satisfy the 
causality postulate (a). 
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The Einstein equations can be derived by requiring that the action 
I= | (A(R-2A)+L)a0 (3.16) 


be stationary under variations of g,,, where L is the matter Lagrangian 
and A a suitable constant. For 


A(R "a 2A) dv) = ((R 7a 2A) $9” Agar + Eyag”” ate g VY AE.») dv. 
The last term can be written 


gvAR,y dv = 92((AT 44), «— (AT ac); 5) dv 


= Ane ge aa Ae gq”), c dv. 
Thus it may be transformed into an integral over the boundary @, 
which vanishes as AT@,, vanishes on the boundary. Therefore 


ol 
ou 


-{ {A (($.R — A)g2 — Re) + 7} Ag, do, (3.17) 
u=0 7) 
and so if ¢f/@u vanishes for all Ag,,, one obtains the Einstein equations 
on setting A = (87)-1. 

One might ask whether varying an action derived from some other 
scalar combination of the metric and curvature tensors might not give 
a reasonable alternative set of equations. However the curvature scalar 
is the only such scalar linear in second derivatives of the metric tensor; 
so only in this case can one transform away a surface integral and be 
left with an equation involving only second derivatives of the metric. 
If one tried any other scalar such as R,,R® or R,y,.qR4 one would 
obtain an equation involving fourth derivatives of the metric tensor. 
This would seem objectionable, as all other equations of physics are 
first or second order. If the field equations were fourth order, it would 
be necessary to specify not only the initial values of the metric and its 
first derivatives, but also the second and third derivatives, in order to 
determine the evolution of the metric. 

We shall assume the field equations do not involve derivatives of 
the metric higher than the second. If these field equations are derived 
from a Lagrangian, then the action must have the form (3.16). One 
could however obtain a system of equations other than the Einstein 
equations, if one restricted the form of the variations Ag,, for which 
the action was required to be stationary. 

For example, one could restrict the metric to be conformal to a flat 


metric, 1.e. assume ‘ 
Jab = Q Nab: 
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where 7,,, is a flat metric as in Special Relativity. Then 
Agay = 20-7A.0G ay 
and the action will be stationary if 
(AGR—A)g® — R®) + TAQ, = 0 
for all AQ, that is if REA? Ay 
From (2.30), 
R= —60-0,,,.9% = — 60710, ,,.9% + 120-0, , 2, 49%, 


where | denotes covariant differentiation with respect to the flat 
metric 7,,. If the metric is static, Q will be constant along the integral 
curves of the Killing vector K (it will be independent of the time £). 
The magnitude of K will be proportional to Q. Therefore 


Fal? + VEV)f-* = Q, apg? + VV) O74 
= —4R+207Q,,.0.,9%—0-10,,V4,, V? 


= —GR+ffiabho9”- 

Thus the Laplacian of f will be equal to — 4 plus a term proportional 
to the square of the gradient of f. This last term may be neglected in 
a weak field. From the field equations, —4 will be equal to 
44-17 —2A. For a perfect fluid, T = —y+ 3p. One will therefore get 
agreement with Newtonian theory if A issmall or zeroand A~! = — 247. 

This theory in which the metric is restricted to be conformally flat 
is known as the Nordstrém theory. It can be reformulated as a theory 
in which the metric is the flat metric y and in which the gravitational 
interaction is represented by an additional scalar field ¢. As men- 
tioned before, this sort of theory would be inconsistent with the 
observed deflection of light by massive objects, and it would not 
account for the measured advance of the perihelion of Mercury. 

One could in fact obtain the observed deflection of light and the 
advance of the perihelion of Mercury if the metric was restricted to be 


of the form 
Jab = 2? (Hap a5 W, W,); 


where W, is an arbitrary one-form field. This would give the Newtonian 
limit in a static metric in which W, was parallel to the timelike Killing 
vector. There could however also be other static metrics where W, was 
not parallel to the Killing vector and these would not give the 
Newtonian limit. Further this restriction on the form of the metric 
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seems rather artificial. It appears more natural not to restrict the 
metric, apart from requiring that it be Lorentzian. 
We therefore adopt as our third postulate, 


Postulate (c): Field equations 


Einstein’s field equations (3.15) hold on -#. 

The predictions of these field equations agree, within the experimen- 
tal errors, with the observations that have been made so far on the 
deflection of light and the advance of the perihelion of Mercury, 
though the question of whether there exists a long range scalar field 
which ought to be included in the energy-momentum tensor remains 
open at the present time. 


4 


The physical significance of curvature 


In this chapter we consider the effect of space-time curvature on 
families of timelike and null curves. These could represent flow lines 
of fluids or the histories of photons. In §4.1 and § 4.2 we derive the 
formulae for the rate of change of vorticity, shear and expansion of 
such families of curves; the equation for the rate of change of expan- 
sion (Raychaudhuri’s equation) plays a central role in the proofs of 
the singularity theorems of chapter 8. In § 4.3 we discuss the general 
inequalities on the energy-momentum tensor which imply that the 
gravitational effect of matter is always to tend to cause convergence of 
timelike and of null curves. A consequence of these energy conditions 
is, as is seen in §4.4, that conjugate or focal points will occur in families 
of non-rotating timelike or null geodesics in general space—times. In 
§4.5 it is shown that the existence of conjugate points implies the 
existence of variations of curves between two points which take a null 
geodesic into a timelike curve, or a timelike geodesic ittto a longer 
timelike curve. 


4.1 Timelike curves 


In chapter 3 we saw that if the metric was static there was a relation 
between the magnitude of the timelike Killing vector and the 
Newtonian potential. One was able to tell whether a body was in a 
gravitational field by whether, if released from rest, it would accelerate 
with respect to the static frame defined by the Killing vector. However, 
in general, space-time will not have any Killing vectors. Thus one will 
not have any special frame against which to measure acceleration; the 
best one can do is to take two bodies close together and measure their 
relative acceleration. This will enable one to measure the gradient of 
the gravitational field. If one thinks of the metric as being analogous 
to the Newtonian potential, the gradient of the Newtonian field would 
correspond to the second derivatives of the metric. These are described 
by the Riemann tensor. Thus one would expect that the relative 
[78] 
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acceleration of two neighbouring bodies would be related to some 
components of the Riemann tensor. 

In order to investigate this relation more precisely we shall examine 
the behaviour of a congruence of timelike curves with timelike unit 
tangent vector V (g(V, V) = —1). These curves could represent the 
histories of small test particles, in which case they would be geodesics, 
or they might represent the flow lines of a fluid. If this were a perfect 


fluid, then by (3.10) (u+p) V4 =—p.,he (4.1) 


where V*= V2.,V® is the acceleration of the flow lines and 
h*, = 6%,+ VV, is the tensor which projects a vector Xe 7, into its 
component in the subspace H, of T, orthogonal to V. One may also 
think of h,, as the metric in H, (cf. §2.7). 

Suppose A(f) is a curve with tangent vector Z = (0/ét),. Then one 
may construct a family A(¢,s) of curves by moving each point of the 
curve A(é) a distance s along the integral curves of V. If one now defines 
Z as (0/0t),¢, 9 it follows from the definition of the Lie derivative (see 
§2.4) that LyZ = 0 or in other words that 


D 
a a b 
5 2 = V4,,2. (4.2) 


One may interpret Z as representing the separation of points equal 
distances from some arbitrary initial points along two neighbouring 
curves. If one adds a multiple of V to Z then this vector will represent 
the separation of points on the same two curves but at different 
distances along the curves. It is really only the separation of neigh- 
bouring curves that one is interested in, not the separation of particu- 
lar points on these curves. One is thus concerned only with Z modulo 
a component parallel to V, i.e. only with the projection of Z at each 
point q into the space Q, consisting of equivalence classes of vectors 
which differ only by addition of a multiple of V. This space can be 
represented as the subspace H, of T, consisting of vectors orthogonal 
to V. The projection of Z into H, will be denoted by , 2? = h4, Z°. In 
the case of a fluid one can regard ,Z as the distance between two 
neighbouring particles of the fluid as measured in their rest frame. 
From (4.2) it follows that 


D 


1.3, (14") = V*,, 2°. (4.3) 


This gives the rate of change of the separation of two infinitesimally 


80 PHYSICAL SIGNIFICANCE OF CURVATURE [4.1 


neighbouring curves as measured in H,. Operating again with D/és 
and projecting into H,, one finds 


bas Cogt 


he 19. ( D zs) es h?,(V? ae eVat Ves, Vtg Vez 
+ V°,.VeVe. .Z,V4+ V°.h°,Z¢,q V4). 


Changing the order of the derivatives in the first term and using (4.2), 
this reduces to 


hiya, Mea, 


D ( b D 2) = — R4,,4 ZeVeVe+ ht, V?. ot Vy, 2. (4.4) 
This equation, known as the deviation or Jacobi equation, gives the 
relative acceleration, i.e. the second time derivative of the separation, 
of two infinitesimally neighbouring curves as measured in H,. We 
see that this depends only on the Riemann tensor if the curves are 
geodesics. 

In Newtonian theory, the acceleration of each particle is given by 
the gradient of the potential ® and therefore the relative acceleration 
of two particles with separation Z¢ is ®,,,Z°. Thus the Riemann 
tensor term F,,,4 V°V4 is analogous to the Newtonian 9. ,,. The effect 
of this ‘tidal force’ term can be seen, for example, by considering a 
sphere of particles freely falling towards the earth. Each particle 
moves on a straight line through the centre of the earth but those 
nearer the earth fall faster than those further away. This means that 
the sphere does not remain a sphere but is distorted into an ellipsoid 
with the same volume. 

In order to investigate the deviation equation further we shall 
introduce dual orthonormal bases E,, E,, E;, E, and E!, E2, E?, E} of 
T, and T*, at some point g on an integral curve y(s) of V, with E* = V. 
One would like to propagate them along y(s) to obtain similar such 
bases at each point of y(s). However, if one parallelly propagates them 
along y(s) (i.e. so that D/és of each vector is zero) E, will not remain 
equal to V, and Ej, E,, E; will not remain orthogonal to V, unless 
y(s) 1s a geodesic. We therefore introduce a new derivative along 


y(s) called the Fermi derivative D,/és. This is defined for a vector 
field X along y(s) by: 
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It has the properties: 
oe DD. : : 
(i) a = aif y(s) is a geodesic; 
(i) Pe® <0; 
(ili) if X and Y are vector fields along y(s) such that 
DyX _ 2 DpY 
a. ss os? 


then g(X, Y) is constant along y(s); 
(iv) if X is a vector field along y(s) orthogonal to V then 
Dax (7 =) 
‘ x 


bs) Noda, 


(This last property shows that the Fermi derivative is a natural 
generalization of the derivative D/és.) 

Thus, if one propagates an orthonormal basis of T, along y(s) so that 
the Fermi derivative of each basis vector is zero, one obtains an 
orthonormal basis at each point of y(s), with E, = V. The vectors 
E,, E,, E, may be interpreted as giving a non-rotating set of axes 
along y(s). These could be realized physically by small gyroscopes 
pointing in the direction of each vector. 

The definition of the Fermi derivative along y(s) can be extended 
from vector fields to arbitrary tensor fields by the usual rules: 

(i) D,/és is a linear mapping of tensor fields of type (r, s) along y(s) 
to tensor fields of type (r,s), which commutes with contractions; 


(ii) = (K@L) = —— @L+K@—_; 
Drf _ df ; 
(iii) oa See where f is a function. 


From these rules it follows that the dual basis E!, E*, E*, E* of T*, is 
also Fermi-propagated along y(s). Using Fermi derivatives, (4.3) and 
(4.4) may be written as: 


D . 

FE Dt = V4, 2, (4.5) 
D*, a a cpoVasrpe Yo. Zes+ poy, Ze 4.6 
552 14 = — R44 2° V°Ve+ b seb + bL . (4.6) 


One may express these equations in terms of the Fermi-propagated 
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dual bases. As .Z is orthogonal to V it will have components with 
respect to E,, E,, E; only. Thus it may be expressed as Z*E, where we 
adopt the convention that Greek indices take the values 1, 2, 3 only. 
Then (4.5) and (4.6) can be written in terms of ordinary derivatives: 


d 

— J+ = V*,, ZF, 4,7 
ae Vp ) 
d* x 

qa” = (— Bat Va, + V2V;) ZF (4.8) 
where V*,, are the components of V7,, for which a = aandb = f. As 
the components Z* obey the first order linear ordinary differential 
equation (4.7), they can be expressed in terms of their values at some 


Bong ee ZH(s) = Agg(8) Z| q (4.9) 
where A, ,(s) isa 3 x 3 matrix which is the unit matrix at q and satisfies 
d 
qa deal) = VA, p(s). (4.10) 


In the case of a fluid the matrix A,, can be regarded as representing the 
shape and orientation of a small element of fluid which is spherical at g. 
This matrix can be written as 


Ap = O.3 83, (4.11) 


where O,, is an orthogonal matrix with positive determinant and S,, 
is a symmetric matrix. These will both be chosen to be the unit matrix 
atq. The matrix O,, may be thought of as representing therotation that 
neighbouring curves have undergone with respect to the Fermi- 
propagated basis while S,, represents the separation of these curves 
from y(s). The determinant of S,,, which equals the determinant of 
A,,;, may be thought of as representing the three-volume of the 
element of the surface orthogonal to y(s) marked out by the neigh- 
bouring curves. 

At q where A,, is the unit matrix, dO,,/ds is antisymmetric and 
dS, ,/ds is symmetric. Thus the rate of rotation of neighbouring curves 
at gis given by the antisymmetric part of J), , while the rate of change 
of their separation from y(s) is given by the symmetric part of V,, B 
and the rate of change of volume is given by the trace of J,. zg We 
therefore define the vorticity tensor as 


Way = h,°h,? Via (4.12) 
the expansion tensor as 
bap = hf hy? Vee; a)s (4.13) 
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and the volume expansion as 
ae — gk Va (4.14) 
We further define the shear tensor as the trace free part of 0,,, 
Tar = Pan — ghar, (4.15) 
and the vorticity vector as 
wt = AyPAV, co5q = fy®OV, Ty, 9 (4.16) 
The covariant derivative of the vector V may be expressed in terms of 
these quantities; Peo oer er nV (4.17) 


This decomposition of the gradient of the fluid velocity vector is 
directly analogous to that in Newtonian hydrodynamics. 
In the Fermi-propagated orthonormal basis the vorticity and 


expansion can be expressed in terms of the matrix A, , and its inverse 
—1 es 
AT 9? 


d 
Oa2 = = ya gq, A ay (4.18) 
A 4.19 
Op =A ve ds Ay? ( -19) 
d 
0 = (det A)-1— (det A). (4.20) 


ds 
From the deviation equation (4.8) it follows that 


d2 


qen tee = (—Ryayat Va; >+VV,) Ayp- (4.21) 


This equation enables one to calculate the propagation of the vorticity, 
shear and expansion along the integral curves of V if one knows the 
Riemann tensor. 
Multiplying by A-',, and taking the antisymmetric part, one 
obtains . d : 
Fg lak = Wye pry + Vias pi (4.22) 
Thus the propagation of vorticity depends on the antisymmetric 
gradient of the acceleration but not the ‘tidal force’. Another form of 
the above equation is 


d 
qs (Ava 8 Asp) 7 A, aVy; Aap: ; (4.23) 
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Therefore A,,0,;A gis a constant matrix if the curves are geodesics; 
in particular, if the curves are geodesics and the vorticity vanishes at 
one point on a curve, it will vanish at all points on the curve. If the 
curves are the flow lines of a perfect fluid it follows from (4.1) that 


1 dp 
Teas = ype as 


If the fluid is isentropic, this implies the conservation law: 


WA, Wy3Agg = constant, (4.24) 
d: 
where log W = ——— 
+p 


This conservation law is the relativistic form of the Newtonian 
vorticity conservation law. In the geodesic or pressure-free case, this 
takes the usual form that the magnitude of the vorticity vector is 
inversely proportional to the area of a cross-section orthogonal to the 
vorticity vector of an element of the fluid. When the pressure is non- 
zero, there is an extra relativistic effect arising from the fact that 
compression of the fluid does work on the fluid and therefore increases 
the mass and so the inertia of an element of the fluid (cf. (3.9)). This 
means that the vorticity of a fluid increases less under campression 
than would otherwise be expected. 
Multiplying (4.21) by A-!,, and taking the symmetric part, one 
finds A 
qs ae = — Ryapa— Ozy yp — 92y9 yp + Vas p+ Ve Vp- (4.25) 
(This equation and (4.23) can be expressed in terms of a general, non- 
orthonormal, non-Fermi-propagated basis by replacing the ordinary 
derivatives with Fermi derivatives and projecting everything into the 
subspace orthogonal to V.) 
The trace of (4.25) is 
do__p V2V> + 2w? — A? 2 a 
is a w®— 20°16? 4 V2. ,, (4.26) 
where 2u" = w,0% > 0 


This equation, which was discovered by Landau and independently by 
Raychaudhuri, will be of great importance later. From it one sees that 
vorticity induces expansion as might be expected by analogy with 
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centrifugal force while shear induces contraction. By the field equa- 
tions, the term R,, V¢V° = 4n(4¢+ 3p) for a perfect fluid whose flow 
lines have tangent vectors V*. Thus one would expect this term also 
to induce contraction. We shall give a general discussion of the sign 
of this term in §4.3. 

The trace-free part of (4.25) is 


Dy 
os 
— 200 gy thao hy? Ve: a — Hqn(2w? — 262+ V2, +4Ro gh), (4.27) 


_ 7d 1 ad 
Tap = — Cua [eel +$h,° h, ie ye an 


where C,,,.q 1s the Weyl tensor. Since this tensor is trace-free it does not 
enter directly in the expansion equation (4.26). However since the 
term — 207 occurs on the right of the expansion equation, the Weyl 
tensor produces convergence indirectly by inducing shear. The 
Riemann tensor can be expressed in terms of the Wey! tensor and the 
Ricci tensor: 


Rarca = Cavca— Gata ys — Gore a a- S29 agIJarv- 
The Ricci tensor is given by the Einstein equations: 
Ray 39a Rh + AGay = 87T ap. 
Thus the Weyl tensor is that part of the curvature which is not deter- 


mined locally by the matter distribution. However it cannot be 
entirely arbitrary as the Riemann tensor must satisfy the Bianchi 


identities: 
R artea: a= 9 
These can be rewritten as 
Ces, = Jabe, (4.28) 
where jabe — Rela; db) a dg? R: a) (4.29) 
These equations are rather similar to Maxwell’s equations in electro- 
dynamics: Fo, = Je, 


where F% is the electromagnetic field tensor and J® is the source 
current. Thus in a sense one could regard the Bianchi identities (4.28) 
as field equations for the Weyl tensor giving that part of the curvature 
at a point that depends on the matter distribution at other points. 
(This approach has been used to analyse the behaviour of gravitational 
radiation in papers by Newman and Penrose (1962), Newman and 
Unti (1962) and Hawking (1966a).) 


4 HLS 
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4.2 Null curves 


The Riemann tensor will affect the rate of change of separation of null 
curves as well as that of timelike curves. For simplicity, we shall con- 
sider only null geodesics. These could represent the histories of 
photons; the effect of the Riemann tensor will be to distort or focus 
small bundles of light rays. 

To investigate this, we consider the deviation equation for a congru- 
ence of null geodesics with tangent vector K (g(K, K) = 0). There are 
two important differences between this case and that of the timelike 
curves considered in the previous section. First, one could normalize 
the tangent vector V to the timelike curves by requiring g(V, V) = — 
In effect this means that one parametrized the curves by the arc- 
length s. However this is clearly impossible with null curves as they 
have zero are-lengths. The best one can do is to choose an affine 
parameter v; then the tangent vector K will obey 


D 
— = b == 
= Kt = K*,,K? = 0. 


However one could multiply v by a function f which was constant 
along each curve. Then fv would be another affine parameter and the 
corresponding tangent vector would be f-1K. Thus, given the curves as 
point sets in the manifold, the tangent vector is only really unique up 
to a constant factor along each curve. The second difference is that 
Q,, the quotient of 7, by K, isnot now isomorphic to H,, the subspace of 
T’, orthogonal to K, since H, includes the vector K itself as g(K, K) = 0. 
In fact as will be shown below, one is not really interested in the whole 
of Q, but only in the subspace S, consisting of equivalence classes of 
vectors in H, which differ only by a multiple of K. In the case of light 
rays, one can regard an element of S, as representing the separation 
between two neighbouring light rays which were emitted at the same 
time by a source. 

As before we introduce dual bases E,, E,, E;, E,, and E!, E2, E3, E4 
of 7, and T¥ at some point q on a curve y(v). However we will not 
chon shew to be orthonormal. We take E, equal to K, E, to be some 
other null vector L having unit negative scalar sited with E, 
(9(Es, E3,) = 0, g(E3,E,) =—1) and E, and E, to be unit spacelike 
vectors, orthogonal to each other and to E, wai By 


(g(E,, E,) = g(E., E,) = il. g(E,, E,) => g(E,, E,) — g(E,, E,) e— 0, etc.). 
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Note that because of the non-orthonormal character of the basis, the 
form E% is in fact equal to the form — Kg, and Eis — Lg,,. It can 
be seen that E,, E, and E, constitute a basis for H,, while the projec- 
tions into Q, of E,, E, and E, form a basis of Q,, and the projections of 
E, and E, form a basis of S,. We shall normally not distinguish between 
a vector Z and its projection into Q, or S,. We shall call a basis having 
the properties of E,, E,, E,, E,, above, pseudo-orthonormal. By 
parallelly transporting them along the geodesic y(v) one obtains a 
pseudo-orthonormal basis at each point of y(v). 

We use this basis to analyse the deviation equation for null geo- 
desics. If Z is the vector representing the separation of corresponding 
points on neighbouring curves, one has, as before: 


OEY 
so Ds. = Ke Ze (4 30) 
dv e 
D2 
and 5 04 = — Byeq ZK K?, (4.31) 


In the pseudo-orthonormal basis K?., will be zero as K is geodesic. 
Therefore one can express the 1, 2 and 3 components of (4.30) as a 
system of ordinary differential equations: 


d 

7 = K+, , 2, 

where as before Greek indices take the values 1, 2, 3. This shows that 
the projection of Z into the space Q, obeys a propagation equation 
which involves only this projection, and not the component of Z 
parallel to K. Further K?., = 0 since (K%g,,K°)., = 0. This implies 
that Z° = —Z*K,, is constant along the geodesic y(v). This can be 
interpreted as saying that light rays emitted from the same source at 
different times maintain a constant separation in time. As this is the 
case, one is more interested in the behaviour of neighbouring null 
geodesics which have purely spatial separations, i.e. one is interested 
in vectors Z for which Z? = 0. The projections of such vectors will 


then lie in the subspace S, and will obey the equation 


d 

ee KG", 

dv i 

where m, n take the values 1, 2 only. This is similar to (4.7) for the 
timelike case, except that now one is dealing only with a two- 
dimensional space of connecting vectors Z. 


4-2 
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As in the previous section, one can express Z” in terms of their 


values at some point q: a 
Z™(v) = Ann(0) 2"lq» 


where A,,,,(v) is a 2x 2 matrix which satisfies 


S Bun?) = Kip Ani) (4.32) 
d? « a 
py? Amnl”) a — Papa Apnl?)- (4.33) 


As before we call the antisymmetric part of K,,., the vorticity Onn, 
the symmetric part the rate of separation 6... and the trace the 
expansion 0. We also define the shear G,,,, a8 the trace-free part of 0,,n. 
They obey similar equations to the analogous quantities in the 
previous section: 


Oe = —O0mns (4.34) 

dg KR, K2K? + 262 262-46? 4.35 

[oe ab + 20% — 20° — 30", (4.35) 
d aA a 

qvom er; CMe (4.36) 


Equation (4.35) is the analogue of the Raychaudhuri equation for 
timelike geodesics. One sees again that vorticity causes expansion 
while shear causes contraction. We shall show in the next section that 
the Ricci tensor term — R,,K7K° will normally be negative, and so 
cause focussing. As before the Weyl tensor does not affect the expan- 
sion directly but causes distortion which in turn causes contraction 
(cf. Penrose (1966)). 


4.3 Energy conditions 


In the actual universe the energy-momentum tensor will be made up 
of contributions from a large number of different matter fields. It 
would therefore be impossibly complicated to describe the exact 
energy-momentum tensor even if one knew the precise form of the 
contribution of each field and the equations of motion governing it. 
In fact, one has little idea of the behaviour of matter under extreme 
conditions of density and pressure. Thus it might seem that one has 
little hope of predicting the occurrence of singularities in the universe 
from the Einstein equations as one does not know the right-hand side 
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of the equations. However there are certain inequalities which it is 
physically reasonable to assume for the energy-momentum tensor. 
These will be discussed in this section. It turns out that in many 
circumstances these are sufficient to prove the occurrence of singu- 
larities, independent of the exact form of the energy-momentum 
tensor. 

The first of these inequalities is: 


The weak energy condition 


The energy-momentum tensor at each pe.W obeys the inequality 
T., W2W® > 0 for any timelike vector We T,. By continuity this will 
then also be true for any null vector We 7,. 

To an observer whose world-line at p has unit tangent vector V, the 
local energy density appears to be 7, V*V°. Thus this assumption is 
equivalent to saying that the energy density as measured by any 
observer is non-negative. This would seem very reasonable physically. 
To investigate further the significance of this assumption we use the 
fact that one may express the components T@ of the energy— 
momentum tensor at p with respect to an orthonormal basis Ej, Eg, 
E,, E,, (E, timelike) in one of four canonical forms. 


Type I. Pi 

0 

‘ Pe 
ab — 
Ps 
0 
Le 

This is the general case in which the energy-momentum tensor has a 
timelike eigenvector E,. This eigenvector is unique unless u= — p, 


(a = 1,2,3). The eigenvalue yw represents the energy—density as 
measured by an observer whose world-line at p has unit tangent 
vector E, and the eigenvalues p, (a = 1, 2,3) represent the principal 
pressures in the three spacelike directions E,. This is the form of the 
energy-momentum for all observed fields with non-zero rest mass and 
also for all zero rest mass fields except in special cases when it is type IT. 


Type IT. p, 9 
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This is the special case in which the energy-momentum tensor has 
a double null eigenvector (E,+ E,). The only observed occurrence of 
this form is for zero rest-mass fields when they represent radiation all 
of which is travelling in the direction E,+ E,. In this case pj, pp and kK 
are zero. 


Type III. 


p 0 0 90 

Te 0 -—v 1 il 
~1o i-v oO 

0 1 0. »p 


This is the special case in which the energy-momentum tensor has 
a triple null eigenvector (E,-+E,). There are no observed fields which 
have energy-momentum tensors of this form. 


Type lV. 


V 0 


This is the general case in which the energy-momentum tensor has no 
timelike or null eigenvector. There are no observed fields which have 
energy-momentum tensors of this form. 

For type I, the weak energy condition will hold if u > 0, 4+p, > 0 
(a = 1, 2,3). For type IT it will hold if p, > 0, p, > 0,« > 0,vy=+41. 
These inequalities are very reasonable requirements and are satisfied 
by all experimentally detected fields. The condition will not hold for 
the physically unrealized types III and IV. 

The condition will also hold for the scalar field é postulated by Brans 
and Dicke and by Dicke (see Dicke (1964)). This field is required to be 
positive everywhere. It has an energy-momentum tensor of the form 
(3.6) where now m = 0. The energy-tensor of the other fields is ¢ times 
what it would have been had the scalar field not existed. 

The condition will not hold for the ‘C’-field proposed by Hoyle and 
Narlikar (1963). This again is a scalar field with m zero, only this time 
the energy-momentum tensor has the opposite sign and so the energy 
density is negative. This allows the simultaneous creation of quanta of 
positive energy fields and of the negative energy C-field. This process 
occurs in the steady-state model of the universe suggested by Hoyle 
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and Narlikar in which, as particles move apart due to the general 
expansion of the universe, new matter is continually being created to 
keep the average density constant. There is, however, a quantum 
mechanical difficulty associated with such a process. For even if the 
cross-section for the process were very small, the infinite phase space 
available to the positive and negative energy quanta would seem to 
result in an infinite number of such pairs being produced in a finite 
region of space-time. 

Such a catastrophe could not occur if the weak energy condition 
held. If a slightly stronger condition holds then creation is impossible 
in the sense that space-time must remain empty if it is empty at one 
time and no matter comes in from infinity. Conversely, matter present 
at one time cannot disappear and so must be present at another time. 
The condition is 


The dominant energy condition 
For every timelike W,, TW, W, > 0, and T®W, is a non-spacelike 
vector. 

This may be interpreted as saying that to any observer the local 
energy density appears non-negative and the local energy flow vector 
is non-spacelike. An equivalent statement is that in any orthonormal 
basis the energy dominates the other components of T',,, i.e. 


ES Te" foreach a, 6: 


This holds for type I if w~20, —w<p, <pw(a=1,2,3) and for 
types) if p==- ek > 0, 0 <p, < «(i = 1,2). In other words, the 
dominant energy condition is the weak energy condition with the 
additional requirement that the pressure should not exceed the energy 
density. This holds for all known forms of matter and there is in fact 
good reason for believing that this should be the case in all situations. 
For the speed of sound waves travelling in the E, direction is dp,/du 
(adiabatic) times the speed of light. Thus dp,/dw must. be less than or 
equal to one, as by postulate (a) in § 3.2 no signal can propagate faster 
than light. It follows that p, < uw, since, for every known form of 
matter, the pressures are small when the density is small. (Bludman 
and Ruderman (1968, 1970) have shown that there might be fields for 
which mass renormalization could lead to pressure being greater than 
the density. We feel, however, that this probably indicates a failure of 
renormalization theory rather than that such a situation would occur.) 
Now consider the situation depicted in figure 9 in which there is a C? 
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Surfaces {¢ = constant} 


4. t increases 


ve (0% )o 


(0%), 
Ficure 9. A compact region Y of space-time with past and future non-timelike 


boundaries (@Y),, (@Y), and timelike boundary (éY),. The part of YW lying to 
the past of the surface #(t’) (defined by ¢ = ¢’) is Y(t’). 


function ¢ whose gradient is everywhere timelike. (It will be shown in 
§ 6.4 that such a function will exist provided space-time is not on the 
verge of violating causality.) The boundary ¢Y of the compact region 
%&% consists of a part (¢%),, whose normal form n is non-spacelike and 
such that n,t.,g% is positive, a part (@Y), whose normal form n is non- 
spacelike and such that n,t.,g?? is negative, and a remaining part 
(0%), (which may be empty). The sign of the normal form n is given by 
the requirement that <n, X) be positive for all vectors X which point 
out of Y (cf. §2.8), H(t’) denotes the surface t = t’ and Y(t’) denotes 
the region of Y for which t < ¢’. For later use in § 7.4 we shall establish 
an inequality which holds not only for the energy-momentum tensor 
T but also for any symmetric tensor S® which satisfies the dominant 
energy condition. Applied to the energy-momentum tensor this 
inequality will show that 7° vanishes everywhere on Y if it vanishes 
on (¢%), and on the initial surface (0%). 


Lemma 4.3.1 


There is some positive constant P such that for any tensor S® which 
satisfies the dominant energy condition and vanishes on (@%),, 


H(t) & , (8%), ’ 


t t 
+ P| (| St, «do) dt’ +{ (| S*,.d95] clic 
HUYNH : HU)" ; 
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Consider the volume integral 


L(t) = i (59275 ))., de -{ St. dv +{ St ade 
Ut) Ut) 2 u(t) 29°; 


By Gauss’ theorem this can be transformed into an integral over the 
boundary of Y(t): 


The boundary of Y(t) will consist of Z(t) n aY and Wn H(t). Since 
S® ig zero on (€Y)s, 


I(t) = i + | =e i ; 
Ut) n (8%); Ut) a (OX). UA KH (t) 


By the dominant energy condition, S2%., is a non-spacelike vector 
such that St. ,t., > 0. As the normal form to (¢%), is non-spacelike 
and such that n,¢.,g% < 0, the second term on the right will be non- 
negative. Thus 


i Ber. do, <— | St a0; 
UO H(t) : Ut) 9 (8%), : 


ze I ay tie FS ata) A, 


Since Y is compact there will be some upper bound to the components 
of ¢, .) in any orthonormal basis whose timelike vector is in the direc- 
tion of t.,. Thus there will be some P > 0 such that on %, 


St. 4, < PSt, 4t., 


for any S® which obeys the dominant energy condition. The volume 
integral over Y(t) can be decomposed into a surface integral over 
HK (t') 0 Y followed by an integral with respect to ¢’: 


. ft 
i (PS"rt., +87 41.) de =| (! (PSt,,+8%,,)de,| di 
H(t) ae i (tN ; 


where do, is the surface element of #(t’). Thus 
i S%, do, <— i St, do, 
H(t) 04 ; Ut) 9 (0H), 


t t 
+P{ ({ St, doy) at's { (| Ss, dv) oy, (a 
HUN , HUY 


As an immediate consequence of this result one has: 
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The conservation theorem 

If the energy-momentum tensor obeys the dominant energy condition 
and is zero on (2%), and on the initial surface (2Y),, then it is zero 
everywhere on %. 


Let ab) — Ty tp, de 
A) ns 


t 
_ | ( | To) dt’ > 0. 
HUY 


Then the above lemma gives dz/dt < Px. But for sufficiently early 
values of ¢, #(é) will not intersect Y and so x will vanish. Thus x will 
vanish for all ¢ which implies that 7 is zero on Y. O 


From the conservation theorem it follows that if the energy- 
momentum tensor vanishes on a set , then it also vanishes on the 


Fieure 10. The future Cauchy development D+(S) of a spacelike set F. 


future Cauchy development D+(S), which is defined as the set of all 
points through which every past-directed non-spacelike curve inter- 
sects / (figure 10) (cf. § 6.5). For if qis any point of D+(.), the region 
of D+(S) to the past of g is compact (proposition 6.6.6) and may be 
taken as Y. This result may be interpreted as saying that the 
dominant energy condition implies that matter cannot travel faster 
than light. 

For our consideration of singularities, the importance of the weak 
energy condition is that it implies that matter always has a converging 
(or more strictly nondiverging) effect on congruences of null geodesics. 
If the vorticity vanishes, the expansion 6 obeys the equation: 


d 
p09 = ~ Pan K*K? ~ 262 — 462, 
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Thus in this case 0 will monotonically decrease along the null geodesic 
if Ra, W°W° > 0 for any null vector W. We shall call this the null 
convergence condition. From the Einstein equations, 


Rap = 29aph Se AGap on 87T.,; 


it follows that this condition is implied by the weak energy condition, 
independent of the value of A. 

From (4.26) it can be seen that the expansion 6 of a timelike geodesic 
congruence with zero vorticity will monotonically decrease along a 
geodesicif R,, W2W? > 0 for any timelike vector W. We shall call this 
the tamelike convergence condition. By the Einstein equation, this condi- 
tion will be satisfied if the energy-momentum tensor obeys the 
inequality, 
T.,WeW > Wwew,(4r— A| 


This will hold for type I if 
il 
+p, 2 9, #+%p,—-F Azo 
and for type IT if 
pi KS OS Op, = 0 and Pit P.—pA > 0 


We shall say that the energy-momentum tensor satisfies the strong 
energy condition if it obeys the above inequality for A = 0. This is a 
stricter requirement than the weak energy condition but it is still 
physically reasonable for the total energy-momentum tensor. For the 
general case, type I, it would be violated only by a negative energy 
density or a large negative pressure (e.g. for a perfect fluid with density 
1gmcm-? it can only be violated if p < — 10 atmospheres). It holds 
for the electromagnetic field and for the scalar field with m zero (in 
particular, it holds for the scalar field of Brans and Dicke). For 
m non-zero, the energy-momentum tensor of a scalar field has the 


fi Seo 
ar (§ ) i = P.aP:0— 49 a0(P:cP a9? + m9?). 
Thus if W2 is a unit timelike vector 


1m? 
T.,, W2W—-41W, WT = (b.4W%)? 555 ¢? (4.37) 
which may be negative. However by the equation of the scalar field 


ae = $44.9”. 
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Inserting this in (4.37) and integrating over a region %, one obtains 


1 a ay 1 ab 
5] Go+2WW)$,.6:0d0-5 | | $9; n08? doy 


The first term will be non-negative since g@+2W*W® is a positive 
definite metric and the second term will be small compared to the first 
if the region Y is large compared to the wavelength h/m. For 7 mesons, 
which may be described classically by a scalar field with 
m = 6x 10-% 9m, this wavelength is 3 x 10-13cm. Thus although the 
energy-momentum tensor of 7 mesons may not satisfy the strong 
energy condition at every point, this should not affect the convergence 
of timelike geodesics over distances greater than 10-12cm. This might 
possibly lead to a breakdown of the singularity theorems in chapter 8 
when the radius of curvature of space-time becomes less than 10- cm 
but such a curvature would be so extreme that it might well count as 
a singularity (§10.2). 


4.4 Conjugate points 


In §4.1 we saw that the components of the vector which represented 
the separation between a curve y(s) and a neighbouring curve in a 
congruence of timelike geodesics, satisfied the Jacobi equation: 

d2 

qa” = —RayggZ? (a, 8 = 1, 2, 3). (4.38) 
A solution of this equation will be called a Jacobi field along y(s). Since 
a solution may be specified by giving the values of Z* and dZ*/ds at 
some point on y(s) there will be six independent Jacobi fields along 
y(s). There will be three independent Jacobi fields which vanish at 
some point g of y(s). They may be expressed as: 


d 
Z*(s) — A, 4(8) qe le 


d2 
where Gee taals) oo — RysysA,,(8); (4.39) 


and A,,(s) is a 3x 3 matrix which vanishes at q. These Jacobi fields 
may be thought of as representing the separation of neighbouring 
geodesics through q. As before one may define the vorticity, shear and 
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expansion of the Jacobi fields along y(s) which vanish at q: 


d 
Oup = AT Ady (4.40) 
—1 d 1 
Oxp =A WO qgAay— 36429, (4.41) 
d 
6 = (det A) (det A). (4.42) 


These will obey the equations derived in §4.1, with V, = 0. In particular 


1 d 
Ay. Wy3 Asp = 5 (4,54 y9- Ang 5 Aye) 
will be constant along y(s). But it vanishes at q where A,, is zero. 
Thus ,, will be zero wherever A,, is non-singular. 

We shall say that a point p on y(s) ts conjugate to gq along y(s) if there 
is a Jacobi field along y(s), not identically zero, which vanishes at 
g and p. One may think of p as a point where infinitesimally neigh- 
bouring geodesics through q intersect. (Note, however, that it may be 
only infinitesimally neighbouring geodesics which intersect at p; there 
need not be two distinct geodesics from qg passing through p.) The 
Jacobi fields along y(s) which vanish at g are described by the matrix 
A,,.Thusapoint pis conjugate to qalong y(s)ifand only if 4,, issingu- 
lar at p. The expansion @ is defined as (det A)~1d (det A)/ds. Since Ay, 
obeys (4.39) where R,,4,,4is finite, d (det A)/ds will be finite. Thus a point 
p will be conjugate to g along y(s) if @ becomes infinite there. The con- 
verse will also be true since 0 = dlog (det A)/ds and A,, can be singular 
only at isolated points or else it would be singular everywhere. 


Proposition 4.4.1 

If at some point y(s,) (s, > 0), the expansion @ has a negative value 
6, < 0 and if R,,V7V° > 0 everywhere then there will be a point 
conjugate to q along y(s) between y(s,) and y(s,+ (3/—9,)), provided 
that y(s) can be extended to this parameter value. (This may not be 
possible if space-time is geodesically incomplete. In chapter 8 we 
shall interpret such incompleteness as evidence of the existence of a 
singularity.) 

The expansion 6 of the matrix A,, obeys the Raychaudhuri equation 
(4.26): Al 


450 = — Ban VAV? — 20° 36" 


98 PHYSICAL SIGNIFICANCE OF CURVATURE [4.4 


where we have used the fact that the vorticity is zero. All the terms on 
the right-hand side are negative. Thus for s > 8 


2 oe 
~ 2 Gree) Se) 


So @ will become infinite and there will be a point conjugate to q for 
some value of s between s, and s,+(3/—6,). O 


In other words, if the timelike convergence condition holds and if the 
neighbouring geodesics from q start converging on 7(s), then some 
infinitesimally neighbouring geodesic will intersect y(s) providing that 
y(s) can be extended to large enough values of the parameter s. 


Proposition 4.4.2 

IfR,, V¢V? > Oandifatsome point p = y(s,) the tidal force R,,,q V°V4 
is non zero, there will be values s, and s, such that g = y(s9) and 
r = y(s,) will be conjugate along y(s), providing that y(s) can be 
extended to these values. 


A solution of (4.39) along y(s) is uniquely determined by the values of 
A,,and dA,,/ds at p. Consider the set P consisting of all such solutions 
for which A,,|,, = 5,,, (dA,,/ds)|, is symmetric with trace, O|,, < 0. 
For each solution in P there will be some s, > s, for which A.,(s3) is 
singular, since either 6|,, < 0, in which case this follows from the 
previous result, or 6|,, = 0, in which case (da, ,/ds)|,, is non-zero which 
will then cause o® to be positive and so cause 6 to become negative for 
8 > 8. The members of the set P are in one-one correspondence with 
the space S of all symmetric 3 x 3 matrices with non-positive trace 
(i.e. with the values of dA,,/ds)|,,). There is thus a map 7 from S 
to y(s) which assigns to each initial value (dA, ,/As)|,, the point on y(s) 
where A, , first becomes singular. The map 77 is continuous. Further if 
any component of (d4,,/ds)|, is very large, the corresponding point 
on y(s) will lie near p, since in the limit the term R, aya in (4.39) becomes 
irrelevant and the solution resembles the flat space case. Thus there is 
some C > Oandsome sy > s, such that if any component of (dd, ,/As)|, 
is greater than C, the corresponding point on y(s) will be before y(s4). 
However the subspace of § consisting of all matrices all of whose com- 
ponents are less than or equal to C, is compact. This shows that there 
is some s, > s, such that 7(S) is contained in the segment from y(s,) to 
(83). Consider now a point r = y(s,) where Sy > 8,. If there is no point 
conjugate to r between r and p, the Jacobi fields which are zero at r 
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must have an expansion 6 which is positive at p (otherwise they would 
be in the set P which represents all families of Jacobi fields with zero 
vorticity which have non-positive expansion at p). It follows from the 
previous result that there is then a point q = y(s 9) (8) < s,) whichis 
conjugate to r along y(s). O 


In a physically realistic solution (though not necessarily in an exact 
one with a high degree of symmetry), one would expect every timelike 
geodesic to encounter some matter or some gravitational radiation and 
so to contain some point where R,,.4 V° V4 was non-zero. Thus it would 
be reasonable to assume that in such a solution every timelike geodesic 
would contain pairs of conjugate points, provided that it could be 
extended sufficiently far in both directions. 

We shall also consider the congruence of timelike geodesics normal 
to a spacelike three-surface, #7. By a spacelike three-surface, #’, we 
mean an imbedded three-dimensional submanifold defined locally by 
f = 0 where fis a C? function and g”f.,f., < 0 when f = 0. We define 
N, the unit normal vector to #, by N* = (—g°f.,f.,)-29%4f. 4 and the 
second fundamental tensor x of # by YX w= h,°h,7N,.4, where 
har = Fan + N,N, 18 called the first fundamental tensor (or induced 
metric tensor) of # (cf. §2.7). It follows from the definition that x is 
symmetric. The congruence of timelike geodesics orthogonal to # will 
consist of the timelike geodesics whose unit tangent vector V equals 
the unit normal N at # Then one has: 


Vit — Xab at KH. (4.43) 


The vector Z which represents the separation of a neighbouring 
geodesic normal to # from a geodesic y(s) normal to #, will obey the 
Jacobi equation (4.38). At a point g on y(s) at # it will satisfy the 


initial condition: d 
a 2 4,44 
d 8 Z AG ap Z ( ) 
We shall express the Jacobi fields along y(s) which satisfy the above 
condition as Z2(s) = A,4(8) ZA|q, 
d2 
where Gates = —RyysAys (4.45) 


and at q, A,, is the unit matrix and 


d 
= Aap = Kay Arp: (4.46) 


100 PHYSICAL SIGNIFICANCE OF CURVATURE [4.4 


We shall say that a point p on y(s) ts conjugate to H along y(s) if there 
is a Jacobi field along y(s) not identically zero, which satisfies the 
initial conditions (4.44) at q and vanishes at p. In other words, p is 
conjugate to # along >(s) if and only if A,, 1s singular at p. One may 
think of p as being a point where neighbouring geodesics normal to # 
intersect. As before A,, will be singular where and only where the 
expansion @ becomes infinite. At q, the initial value of Ay, Ag, will 
be zero, therefore w,, will be zero on (s). The initial value of @ will 


be Xag”. 


Proposition 4.4.3 

If R,, V2V° > 0 and x,,9% < 0, there will be a point conjugate to H 
along y(s) within a distance 3/(—X..g%) from #, provided that y(s) 
can be extended that far. 


This may be proved using the Raychaudhuri equation (4.26) as in 
proposition 4.4.1. O 


We shall call a solution of the equation: 


ae 
ae = — RiaansZ” (m,n = iL. 2) 
along a null geodesic y(v), a Jacobi field along y(v). The cOmponents 
Z™ could be thought of as the components, with respect to the basis E, 
and E,, of a vector in the space S, at each point g. We shall say that 
p is conjugate to g along the null geodesic y(v) if there is a Jacobi field 
along y(v), not identically zero, which vanishes at q and p. If Z is 
a vector connecting neighbouring null geodesics which pass through q, 
the component Z° will be zero everywhere. Thus p can be thought of 
as a point where infinitesimally neighbouring geodesics through q 
intersect. Representing the Jacobi fields along y(v) which vanish at ¢ 
by the 2 x 2 matrix A, 
~ d 

Z™(v) a Amn Fp 

One has as before: A, Ox Ann = 0, 80 the vorticity of the Jacobi fields 


which are zero at p vanishes. Also p will be conjugate to g along y(v) 
if and only if 


ZA... 


6 = (det Aye (det A) 


becomes infinite at p. Analogous to proposition 4.4.1, we have: 
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Proposition 4.4.4 


If R,,K°K° > 0 everywhere and if at some point y(v,) the expansion 6 
has the negative value 6, < 0, then there will be a point conjugate 
to q along y(v) between y(v,) and y(v, + (2/—9,)) provided that y(v) 
can be extended that far. 


The expansion 6 of the matrix 4,,, obeys (4.35): 
on 
dv 


and so the proof proceeds as before. O 


6 = —R,, KK — 262-162, 


Proposition 4.4.5 

If R,»K°K° > 0 everywhere and if at p = y(v,), K°K*K,, Ry aK y is 
non-zero, there will be vp and v, such that q = y(v») and r = y(v,) will 
be conjugate along y(v) provided y(v) can be extended to these values. 


If K°K’K,, Ry) -aeK p) is non zero then so is B,,4,4. The proof is then 
similar to that of proposition 4.4.2. O 


As in the timelike case, this condition will be satisfied for a null 
geodesic which passes through some matter provided that the matter 
is not pure radiation (energy-momentum tensor type II of §4.3) and 
moving in the direction of the geodesic tangent vector K. It will be 
satisfied in empty space if the null geodesic contains some point where 
the Weyl tensor is non-zero and where K does not lie in one of the 
directions (there are at most four such directions) at that point for 
which K°K?K(,C,\,aeK = 0. It therefore seems reasonable to assume 
that in a physically realistic solution every timelike or null geodesic 
will contain a point at which K*K°K,, Rage,-Ky is not zero. We shall 
say that a space-time satisfying this condition satisfies the generic 
condition. 

Similarly we may also consider the null geodesics orthogonal to 
a spacelike two-surface Y. By a spacelike two-surface Y, we mean an 
imbedded two-dimensional submanifold defined locally by f, = 0, 
fo = 0 where f, and f, are C? functions such that when /, = 0, f, = 0 
then f,,, and f,.,, are non-vanishing and not parallel and 


(fi:at+ Hfe;a) (fio t+Hfe;v) 9” = 


for two distinct real values 1, and 4, of #4. Then any vector lying in the 
two-surface is necessarily spacelike. We shall define V7 and 4’, the 
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two null vectors normal to Y, as proportional to g”(f,.) +/4, fe.) and 
g9”"”(fizp +Hofo.,) respectively, and normalize them so that 


NN 2 Gap =—1. 


One can complete the pseudo-orthonormal basis by introducing two 
spacelike unit vectors Y,¢ and Y,? orthogonal to each other and to N,* 
and N,*. We define the two null second fundamental tensors of F as: 


nXab = — Nyesa(Vi° Via + Ye" Yea) (Y,7¥,, + Y,7Y2;); 


where n takes the values 1, 2. The tensors ,y,,, and o¥,,) are Symmetric. 

There will be two families of null geodesics normal to / corre- 
sponding to the two null normals N,? and N,*. Consider the family 
whose tangent vector K equals N, at Y. We may fix our pseudo- 
orthogonal basis E,, E,, E,, E, by taking E, = Y,, E, = Y,, E, = Nj, 
E, = N, at and parallelly propagating along the null geodesics. 
The projection into the space S, of the vector Z representing the 
separation of neighbouring null geodesics from the null geodesic y(v) 
will satisfy (4.30) and the initial conditions 


d 


Fe (4.47) 


= 
at q on y(v) at S. As before the vorticity of these fields will be zero. 
The initial value of the expansion @ will be ,y,,9%. Analogous to 
proposition 4.4.3 we have: 


Proposition 4.4.6 


If R,,K°K® > 0 everywhere and 4y,,9% is negative there will be a 
point conjugate to / along y(v) within an affine distance 2/(— egg”) 
from S. Oo 


From their definition, the existence of conjugate points implies the 
existence of self-intersections or caustics in families of geodesics. A 
further significance of conjugate points will be discussed in the next 
section. 


4.5 Variation of arc-length 


In this section we consider timelike and non-spacelike curves which 
are piecewise C* but which may have points at which their tangent 
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vector is discontinuous. We shall require that at such points the two 
tangent vectors 
)-- 
+ 


that is, they point into the same half of the null cone. 


0 é | @ 
al. and si, satisfy (5| >% 


Proposition 4.5.1 


Let Y be a convex normal coordinate neighbourhood about g. Then the 
points which can be reached from q by timelike (respectively non- 
spacelike) curves in Y are those of the form exp,(X), Xe7,, where 
g(X, X) < 0 (respectively < 0). (Here, and for the rest of this section, 
we consider the map exp to be restricted to the neighbourhood of the 
origin in J, which is diffeomorphic to Y under exp,.) 


In other words, the null geodesics from g form the boundary of the 
region in ¥ which can be reached from q by timelike or non-spacelike 
curves in Y. This is fairly obvious intuitively but because it is funda- 
mental to the concept of causality we shall prove it rigorously. We 
first establish the following lemma: 


Lemma 4.5.2 


In @ the timelike geodesics through g are orthogonal to the three- 
surfaces of constant a (a < 0) where the value of o at pe is defined 


to be g(exp, *p, exp, *p). 


The proof is based on the fact that the vector representing the separa- 
tion of points equal distances along neighbouring geodesics remains 
orthogonal to the geodesics if it is so initially. More precisely, let X(t) 
denote a curve in 7, where g(X(é), X(¢)) = — 1. One must show that 
the corresponding curves A(t) = exp,(S)X(t)) (89 constant) in Y%, where 
defined, are orthogonal to the timelike geodesics y(s) = exp,(sX(é))) 
(¢, constant). Thus in terms of the two-surface « defined by 
a(s,t) = exp,(sX(t)), one must prove that 


é é 
o((a).- (a) =" 
(see figure 11). Now 


a (a @\_ (222) 
as? \as’ at) 7 \@s as? at} ° 7 \ds’ Os dt)’ 
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Geodesic 


Surface ¢ = constant 
(¢ =—s*) 


Null Null cone 


cone 


Length | 
a 


Ficure 11. In a normal neighbourhood, surfaces at constant distance from q 
are orthogonal to the geodesics through gq. 


~ 


The first term on the right is zero as ¢/¢s is the unit tangent vector to 
the timelike geodesics from qg. In the second term one has from the 
definition of the Lie derivative that 


Th 0 -, 5) = ‘é@ De\ tea =, a)=0 

as7 \as? Ot. =9\5,. at as) 2 a9 \as’ ds) — 

Therefore 9(é/és, /et) is independent of s. But at s = 0, (@/ét), = 0. 
Thus g(@/és, @/ét) is identically zero. a) 


Proof of proposition 4.5.1. Let C, denote the set of all timelike vectors 
at q. These constitute the interior of a solid cone in T, with vertex at 
the origin. Let y(t) be a timelike curve in Y from q to p and let F(t) be 
the piecewise C? curve in 7, defined by 7(t) = exp, “(y(t)). Then 
identifying the tangent space to 7, with 7, itself, one has 


(0/é),|, = (0/ét);|,. 
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Therefore at q, (2/ét); will be timelike. This shows that the curve v(t) 
will enter the region C,. But exp, (C,) is the region of Y on which g is 
negative and in which by the previous lemma the surfaces of constant 
o are spacelike. Thus o must monotonically decrease along y(t) since 
(0/et), being timelike can never be tangent to the surfaces of constant o 
and since at any non-differentiable point of y(t) the two tangent 
vectors point into the same half of the null cone. Therefore pé exp,(C,) 
which completes the proof for timelike curves. To prove that a non- 
spacelike curve y(t) remains in exp, (C,), one performs a small varia- 
tion of y(¢) which makes it into a timelike curve. Let Y be a vector 
field on 7, such that in YW the induced vector field exp,«(Y) is every- 
where timelike and such that g(Y, (0/ét),,|q) < 0. For each € > 0 let 
A(r,é) be the curve 7, starting at the origin such that the tangent 
vector (0/ér), equals (0/ét)s|,-, +€Y| 4, .. Then A(r,e) depends differ- 
entiably on r and e. For each € > 0, exp, (A(r,€)) is a timelike curve 
in Y and so is contained in exp, (C,). Thus the non-spacelike curve 


exp, (A(r, 0)) = y(r) is contained in exp, (C,) = exp, (C,). oO 


Corollary 


If pe can be reached from g by a non-spacelike curve but not by a 
timelike curve, then p lies on a null geodesic from q. O 


The length of a non-spacelike curve y(t) from q to p is 


24> = —¥ Bp? Og ? 
REI | a at 


where the integral is taken over the differentiable sections of the curve. 

In a positive definite metric one may seek the shortest curve between 
two points but in a Lorentz metric there will not be any shortest curve 
as any curve can be deformed into a null curve which has zero length. 
However, in certain cases there will be a longest non-spacelike curve 
between two points or between a point and a spacelike three-surface. 
We deal first with the situation when the two points are close together. 
We shall then derive necessary conditions in the general case when the 
two points are not close. The sufficient condition in this case will be 
dealt with in §6.7. 


Proposition 4.5.3 

Let g and p lie in a convex normal neighbourhood @%. Then, if q and p 
can be joined by a non-spacelike curve in Y%, the longest such curve 
is the unique non-spacelike geodesic curve in Y from q to p. Moreover, 
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defining p(q,p) as the length of this curve if it exists, and as zero 
otherwise, p(q, p) is a continuous function on Y x %. 


By the definition of convex normal neighbourhoods (§2.5), there is 
a unique geodesic y(t) in Y with y(0) = g, y(1) = p. Since this geodesic 
depends differentiably on its endpoints, the function 


va) J9((4), (a),)# 


will be differentiable on WY x Y. (This function o is the same as that 
in lemma 4.5.2.) Thus p(q,p) will be continuous on Y%x W since it 
equals [—o(q, p)]? if o < 0 and is zero otherwise. It now remains to 
show that if g and p can be joined by a timelike curve in Y then the 
timelike geodesic y between them is the longest such curve. Let a(s, t) 
be exp, (sX(¢)) as before where g(X(t), X(¢)) = —1. If A(t) is a time- 
like curve in Y from q to p=, it can be represented as A(t) = x(f(t), t). 


a (2), -ro(2),+(8). 


Since the two vectors on the right are mutually orthogonal by lemma 
4.5.2, and since g((@/0s),, (@/@s),) = — 1, this gives 


a((5) -(5),)--vor+e((Z) (2) ) > -wor. 


the equality holding if and only if (2/ét), = 0 and hence if and only if 
A is a geodesic curve. Thus 


L(..4.) < iI "fat = pla), 


the equality holding if and only if A is the unique geodesic curve in Y 
from q to p. | 


We shall now consider the case where q and 7 are not necessarily 
contained in a convex normal neighbourhood ZY. By considering small 
variations we shall derive necessary conditions for a timelike curve 
y(t) from ¢ to p to be the longest such curve from q to p. A variation a 
of y(t) is a C!- map a: (—e,e) x [0, t,]—>-@ such that 

(1) a(0,t) = y(t); 

(2) there is a subdivision 0 = t, < t,... < t, = t, of [0,¢,] such that 
a is C3 on each (—e,€) x [t,, t4.4]5 

(3) a(u, 0) = q; a(u, t,) =P; 

(4) for each constant w, a(u,t) is a timelike curve. 
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The vector (8/éx),|,,-9 will be called the variation vector Z. Con- 
versely, given a continuous, piecewise C2 vector field Z along y(t) 
vanishing at q and p, we may define a variation a for which Z will 
be the variation vector by: 


a(u,t) = exp, (uZ|,), 
where ue¢(—e,e) for some € > 0 and r = y(E). 


Lemma 4.5.4 


The variation of the length from g to p under « is 


Bila” Sih, eel aa Ca)al) + Bo(Ge [Pa)). 


ou 
where f? = g(é/ét, é/¢t) is the magnitude of the tangent vector and 
[f—* ¢/ét] is the discontinuity at one of the singular points of y(t). 


We have: 
veo Paul -a(Ge a) # 
~-2fofB 8,2) 
22 ee 


=-2 liege a))-9(eaa) 


Integrating the first term by parts one has the required formula. . 0 


OL 
ou 


One may simplify the formula by choosing the parameter ¢ to be the 
arc-length s. Then g(¢/ét, é/¢t) = —1. We shall denote by V the unit 
tangent vector @/és. One has: 


oL 

ou 
where V = DV/@s is the acceleration. From this one sees again that a 
necessary condition for y(t) to be the longest curve from q to p is that 
it should be an unbroken geodesic curve as otherwise one could choose 
a variation which would yield a longer curve. 

One may also consider a timelike curve y(t) from a spacelike three- 
surface # to a point p. A variation @ of this curve is defined as before 
except that condition (3) is replaced by: 

(3) a(w, 0) lies on #, a(u,t,) = p. 

Thus at # the variation vector Z = @/éu lies in #. 


=I fi+1 


< nat 
== | 9(Z,V)ds+ 5 9(Z,(V)) 


u=0 i=l 
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Lemma 4.5.5 
al m-1 fis. , n-1 
Fahce 7 2, MU z)de+ S912, [V) +912, leno 
Ulu=0 i=llts i=2 
The proof is as for lemma 4.5.4. a 


From this one sees that a necessary condition for y(t) to be the longest 
curve from # to p is that it is an unbroken geodesic curve orthogonal 
to #. 

We have seen that, under a variation «, the first derivative of the 
length of a timelike geodesic curve is zero. To proceed further we shall 
calculate the second derivative. We define a two-parameter variation 
a of a geodesic curve y(t) from g to p as a C! map: 

a: (~€, €,) x (— €g, €9) x [0,£,] > 4 
such that 

(1) (0, 0,4) = y(t); 

(2) there is a subdivision 0 = t, < tg < ... < t, =t, of [0,¢,] such 
that a is C? on each 


(—€,,€;) x (— 6s, €3) x [¢,, t244]5 


(3) (Uy, Us, 0) a q; (Uy, Ua, ty) =D; 
(4) for all constant w,, ug, a(u,, Us, t) is a timelike curve. , 


We define 
a) 
| A fees 
: lee 
a 
| Se 
Eee 


as the two variation vectors. Conversely given two continuous, piece- 
wise C? vector fields Z, and Z, along y(t) one may define a variation 
for which they will be the variation vectors, by: 
(21, Ua, t) = exp, (u,Z,+UZ,), 
r= ¥(é). 
Lemma 4.5.6 


Under the two-parameter variation of the geodesic curve y(é), the 
second derivative of the length will be: 


aL 
OUy OUy 


U,=0 t=1 Jt; 


(sil deere D2 
ey i a(Z, [ager 01V 24) V) RCV, Z,)¥]) ds 


n—1 


+Z0(2 aq (ZataV, Za) v)]}. 
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By lemma 4.5.4, one has: 


mates” = [9am aa-t (aya a) *+24(5[A). 


= 
Therefore 
santa fess =) 9(5n,tmel'a aS (a) a) 
-2fo(a-|f*(2)aa-t maa 
-269(50) (a)att* (aca) a!*() aad) 
+29(2- ae If 5) +24 (a 2 |f “ll 


The first and third terms vanish as y(t) is an unbroken geodesic curve. 
In the second term one can write: 


DDé__p(@ @)@,DD a 
Ou, ot at Gt’ Guy) dt * ot Buy dt 
oe) Os 
i Gt? Guy) Ot Ot Gu, 
af a/(,4 (Da s) 
god saa al (in BH 


en offers a De 
=, Sara @ eh fie. ee 
at AG (sx) f Cae a) 


In the fourth term: 


fast ta| = [fa aa, ta aya) al 


Then taking ¢ to be the arc-length s, one obtains the required result. D 


Although it is not immediately obvious from the appearance of the 
expression, one knows from its definition that it is symmetric in the 
two variation vector fields Z, and Z,. One sees that it only depends on 
the projections of Z, and Z, into the space orthogonal to V. Thus we 
can confine our attention to variations « whose variation vectors are 
orthogonal to V. We shall define 7), to be the (infinite-dimensional) 
vector space consisting of all continuous, piecewise C* vector fields 
along y(t) orthogonal to V and vanishing at q and p. Then é?L/ou, du, 
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will be a symmetric map of 7, xT, to R'. One may think of it as 

a symmetric tensor on 7, and write it as: 
@L 

Oty OU, rao’ 


L(Z,,Z) = Tedaely 


One may also calculate the second derivative of the length from # 
to p of a geodesic curve y(t) normal to H#H. One proceeds as before 
except that one endpoint of y(¢) is allowed to vary over # instead of 
being fixed. 


Lemma 4.5.7 
The second derivative of the length of y(i) from # to pis: 
-1 Pts: 2 
ae | ‘ a (Zs [PeZ-—RWV,Z,)V]] ds 


u= 
OUgOUy|uy=0 i=l) cs 


n-1 D D 
3 (20 [z2s]) +0(275%), 


where Z, and Z, have been taken orthogonal to V and y(Zy, Zz) is the 
second fundamental tensor of #. 


— x(Zy, Zp) 


> 


The first two terms are as for lemma 4.5.6. The extra terms are: 
D 0 0 D @@ 
esi (Ae 2S las = f= 1 ee ee 
Dite? (= f My f a= ou,’ 5) ” 


fini( a tee 
IN Gu, Bt at) 7 \au,? Ble? 7 \Ouy? et Guy 

The second term vanishes as ¢/éu, is orthogonal to ¢/ét. If one takes 

t to be the arc-length s, then ¢/ct will be equal to the unit normal N 


at #. Since the endpoint of y(é) is restricted to varying over #, 0/0u, 
will always be orthogonal to N. Thus 


D @ é a a D a @ 
(3a 8) ~ an Gee §) - (gee ze, 9) ~—* (aap Sn) 
We shall say that a timelike geodesic curve y(t) from q tu p is maximal 
if L(Z,, Z,) is negative semi-definite. In other words, if y(t) is not 
maximal there is a small variation « which yields a longer curve from 
p to q. Similarly we shall say that a timelike geodesic curve from # 
to p normal to # is maximal if L(Z,, Z,) is negative semi-definite, 


so if y(t) is not maximal there is a small variation which yields a longer 
curve from # to p. 


x 


H 
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Proposition 4.5.8 


A timelike geodesic curve y(t) from gq to p is maximal if and only if 
there is no point conjugate to q along y(t) in (q, 7). 


Suppose there is no conjugate point in (q, p). Then introduce a Fermi- 
propagated orthonormal basis along y(t). The Jacobi fields along y(é) 
which vanish at q will be represented by a matrix A,,(¢) which will be 
non-singular in (g, p), but which will be singular at q and possibly at p. 
Since conjugate points are isolated, d(log det A)/ds will be infinite 
where A,, is singular. Thus a C®, piecewise C® vector field ZeT, 
can be expressed in [q, p] as 


ZX = Ax, Ws, 
where W? is C°, piecewise C1 on [q, p]. Then, 


OU IA = > ali Ay We | Ta W) + Rasya Aya we} as 


me ’ 
+24,,W? E (Avs a) 


Sp d d? 
2a eS 8 
= lim 5 | an lor = W°+A,; °as W |as 


: ro lene 
+ EA, WA, Ee W | 


8p d d 
aoe — We 78 B(— A, Ags 
zi’ {Aseqg Wee, W + Wels. n 


daa ae, 
~ Ay GeAas) SW |. 


(We take the limit because the second derivative of W® may not be 
defined at q.) But 


d d 


Therefore L(Z, Z) < 

Conversely, suppose setweie ji is a point rE (q, p) conjugate to g along 
y(t). Let W be the Jacobi field along y which vanishes at q and r. 
Let K €7, be such that 


Keg, — We = —1 at 1. 
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Extend W to p by putting it zero in [r, p]. Let Z be eK + €~*W, where 
€ is some constant. Then 


L(Z,Z) = e2L(K, K) + 2L(K, W) + 2e-2L(W, W) = e2L(K, K) + 2. 
Thus by taking € small enough, L(Z, Z) may be made positive. Oo 


One may obtain similar results for the case of a timelike geodesic 
curve y(t) orthogonal to #, from # to p. 


Proposition 4.5.9 
A timelike geodesic curve y(t) from # to p is maximal if and only if 
there is no point in (#, q) conjugate to # along y. O 


We shall also consider variations of a non-spacelike curve y(¢) from q 
to p. We shall be interested in the circumstances under which it is 
possible to find a variation « of y(t) which makes g(@/ét, 0/ét) negative 
everywhere, or in other words, yields a timelike curve from q to p. 
Under a variation a: 


a(,(2 ®\\_ (D2 2)\_» es 
Bu \9 \ a0 ae} ) ~ “Fou ae oe) ~ “Nae aw 


af.(a é@ @éDa\. 
= 9) al 
2a (1 (=. =) 9 (=. et 5): ieee) 


In order to obtain a timelike curve from q to p, one requires this to be 
less than or equal to zero everywhere on y(t). 


Proposition 4.5.10 


If p and q are joined by a non-spacelike curve y(t) which is not a null 
geodesic they can also be joined by a timelike curve. 


If y(t) is not a null geodesic curve from p to q, there must be some point 
at which the tangent vector is discontinuous, or there must be some 
open interval on which the acceleration vector (D/ét) (é/ét) is non-zero 


and not parallel to @/ét. Consider first the case where there are no 
discontinuities. One has 


fe é 4 _128/ (2a @ 

"Nae at at) = 3H \9 i) i 

This shows that (D/ét) (@/ét) is a spacelike vector where it is non-zero 
and not parallel to é/é. Let W be a C? timelike vector field along y(t) 
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such that g(W, d/ét) < 0. Then one will obtain a timelike curve from 
p to q under the variation whose variation vector is 


Da 
L= —— 
ed ra 
fe 
with i= canoe | e—2t(1 — dya?) dt, 
tg 
Do Da 
h a 
ae : (awa 5) 
0 
e=-9(W,3), 


Da 
eS —¢c-l9 |W, —— 
max c a( aay) on [p,q], 


and y is a C? non-negative function on [p,q] such that y, = y, = Oand 


tp 
i et (1 — Lya?) dt = 0. 
ty 


Suppose now there is some subdivision t, < t, < t, < ... < t, such that 
the tangent vector 0/ét is continuous on each segment [t,,¢,,,]. If 
a segment [t,,¢;,,] is not a null geodesic curve, it can be varied to give 
a timelike curve between its endpoints. Thus one has only to show 
that one can obtain a timelike curve from a non-spacelike curve y(t) 
made up of null geodesic segments whose tangent vectors are not 
parallel at points of discontinuity y(¢,). The parameter ¢ can be taken 
to be an affine parameter on each segment [¢,, ¢;,,]. The discontinuity 
[2/ét]|,, will be a spacelike vector, as it is the difference between two 
non-parallel null vectors in the same half of the null cone. Thus one 
can find a C? vector field W along [¢,_,, t;,,] such that g(W, @/ét) < Oon 
[t,_1,¢,] and g(W, é/ét) > 0 on [t,,t,,,]. Then a timelike curve between 
y(t;_,) and y(t,,,) will be obtained from the variation with variation 
vector field Z =xW, where x = c“(t,,, —#,) ((—#,_,) for t,_, <t < t,, 
and « = c—1(t;—t,_,) (ts4,—¢) fort, < t < t,,,, wherec = —g(W, 0/at). O 
Thus if y(t) is not a geodesic curve, it can be varied to give a timelike 
curve. If it is a geodesic curve, the parameter ¢ may be taken to be an 
affine parameter. One then sees that a necessary, but not sufficient, 
condition for a variation to yield a timelike curve is that the variation 
vector @/éu should be orthogonal to the tangent vector 0/ét everywhere 
on y(é), since otherwise (0/0t) g(@/éu, 6/ét) would be positive somewhere 
on y(t). For such a variation the first derivative (0/0w) 9(@/ét, 0/ct) will 
be zero and so one will have to examine the second derivative. 
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We shall therefore consider a two-parameter variation a of a null 
geodesic y(t) from q to p. The variation @ will be defined as before 
except that, for the reason given above, we shall restrict ourselves to 
variations whose variation vectors 


0 
Pi; 
are orthogonal to the tangent vector @/dé¢ on y(t). 
It is not convenient to study the behaviour of L under such a varia- 
tion since (—g(@/ét, 2/ét))? is not differentiable when g(0/dt, d/ét) = 0. 
Instead we shall consider the variation in: 


n—-1 tira GF 
A=—'S | aap a) a 


# 


u,=0 and Bie. w=0 
U2.=0 U2 |u.=0 


Clearly a necessary but not sufficient condition that a variation a of 
y(t) should yield a timelike curve from gq to p is that A should become 
positive. 

One has 


L o Cigc\\ wes 7) ;)) 7) @ Dee 

2 Ou, Ou, I\ ot ot  OUy Ot 9 du,’ ot CUy id du,’ ot et 
_ aa a @ (D2 é 
~ Baty GE \9 \Gauy? Gt) — 9 \ Bray? \ G2 Dury, 


oe @ 
Sh 


and so 
1 @A 0 (Reo Oa \a 
ee ee Te 
2 Bi, uno {o (= ee OU, (5. i) 5) i 


er lb) <3 
Tr (e \jangeiall . 
— = E = a) 


This formula is very similar to that for the variation of the length of 
a timelike curve. It can be seen that the variation of A is zero for a 
variation vector proportional to the tangent vector 0/é since 0/ét is 
null and R(d/ét, 0/ét)(2/ét) = 0 as the Riemann tensor is anti- 
symmetric. Such a variation would be equivalent to simply repara- 
metrizing y(t). Thus if one wants a variation which will give a timelike 
curve one need consider only the projection of the variation vector into 
the space S, at each point q of y(t). In other words, introducing a 
pseudo-orthonormal basis E,, E,, Eg, E, along y(t) with E, = 0/ét, the 
variation of A will depend only on the components Z” of the variation 
vector (m = 1, 2). 
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Proposition 4.5.11 

If there is no point in [q, p] conjugate to g along y(é) then CEN da Alpes 
will be negative for any variation « of y(t) whose variation vector 
0/0u|,,-9 is orthogonal to the tangent vector @/ét on y(t) and is not every- 
where zero or proportional to é/ét. In other words, if there is no point 
in [q,p] conjugate to qg then there is no small variation of y(t) which 
gives a timelike curve from q to p. 


The proof is similar to that for proposition 4.5.8, using instead the 
2x 2 matrix A,,,, of §4.2. 0 


Proposition 4.5.12 


If there is a point r in (¢, p) conjugate to q along y(¢) then there will be 
a variation of y(t) which will give a timelike curve from q to p. 


The proof is a bit finicky since one has to show that the tangent vector 
becomes timelike everywhere. Let W™ be the components in the space 
S (see §4.2) of the Jacobi field which vanishes at g and r. It obeys 

d2 

df We ae 
where for convenience ¢ has been taken to be an affine parameter. 
Since W™” will be at least C? and since dW™/dé is not zero at q and r, 
one can write W™ = fIV™ where Wis a unit vector and f and Ware C”. 
Then a 
Githf =0, 


a Ble A Ss 
where ay ie We Roan OW. 


Let xe€[r, p] be such that Wis not zero in [7, z]. Let h, be the minimum 
value of h in [r,xz]. Let a>0 be such that a?+h,> 0 and let 
b = {—f(e“—1)-}],. Then the field 


zm = (et —1) +f} 0™ 
will vanish at q and x and will satisfy 


d? ; 
dm ("+ Bens") = 0 in (¢,2). 


We shall choose a variation a(u,t) of y(t) from g to x such that the 
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components in S of its variation vector 6/éu|,-» equals Z” and such 


7 & Bu? at 
satisfies 


Dee ee a3) 
a ae uno) \Ow at du 


where t, is the value of t at x, and € > 0 but less than the least value of 
Z™ (2Z"/dt?+ RranaZ”) in the range }t, < t < #,. Then by (4.49) 
(22/0u?) g(0/et, @/et) will be negative everywhere in [g,x] and so for 
sufficiently small wu, a will give a timelike curve from q to x. If one joins 
this curve to the section of y from z to p, one will obtain a non-spacelike 
curve from g to p which is not a null geodesic curve. Thus there will 
be a variation of this curve which gives a timelike curve from gto p. O 


u=0 


—e for 0<i< hi, 
={e(t—#t,) for Ht, <t <i, 


=0 
. e(t,-t) for 3t,<t<t,, 


By similar methods one can prove: 


Proposition 4.5.13 


If y(é) is a null geodesic curve orthogonal to a spacelike two-surface Y 
from S to p and if there is no point in [.Y, p] conjugate to FY along y, 
then no small variation of y can give a timelike curve fronrY to p .O 


Proposition 4.5.14 
If there is a point in (/,p) conjugate to ¥ along p, then there is a 
variation of y which gives a timelike curve from ¥ to p. 0 


These results on variations of timelike and non-spacelike curves will 
be used in chapter 8 to show the non-existence of longest geodesics. 


5 


Exact solutions 


Any space-time metric can in a sense be regarded as satisfying 
Einstein’s field equations 


Ra sa Rg ste AQas = onl, (5.1) 


(where we use the units of chapter 3), because, having determined the 
left-hand side of (5.1) from the metric tensor of the space-time 
(.@,8), one can define T,,, as the right-hand side of (5.1). The matter 
tensor so defined will in general have unreasonable physical properties; 
the solution will be reasonable only if the matter content is reasonable. 

We shall mean by an exact solution of Einstein’s equations, a space-— 
time (4,8) in which the field equations are satisfied with Ti, the 
energy-momentum tensor of some specified form of matter which 
obeys postulate (a) (‘local causality’) of chapter 3, and one of the 
energy conditions of §4.3. In particular, one may look for exact 
solutions for empty space (7, = 0), for an electromagnetic field (Ti, 
has the form (3.7)), for a perfect fluid (T,, has the form (3.8)), or for 
a space containing an electromagnetic field and a perfect fluid. 
Because of the complexity of the field equations, one cannot find 
exact solutions except in spaces of rather high symmetry. Exact 
solutions are also idealized in that any region of space-time is likely to 
contain many forms of matter, while one can obtain exact solutions 
only for rather simple matter content. Nevertheless, exact solutions 
give an idea of the qualitative features that can arise in General 
Relativity, and so of possible properties of realistic solutions of the field 
equations. The examples we give will show many types of behaviour 
which will be of interest in later chapters. We shall discuss solutions 
with particular reference to their global properties. Many of these 
global properties have only recently been discovered, although the 
solutions have been known in a local form for some time. 

In §5.1 and § 5.2 we consider the simplest Lorentz metrics: those of 
constant curvature. The spatially isotropic and homogeneous cosmo- 
logical models are described in §5.3, and their simplest anisotropic 

g iL alee] HLS 
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generalizations are discussed in § 5.4. It is shown that all such simple 
models will have a singular origin provided that A does not take large 
positive values. The spherically symmetric metrics which describe 
the field outside a massive charged or neutral body are examined in 
§5.5, and the axially symmetric metrics describing the field outside 
a special class of massive rotating bodies are described in §5.6. It is 
shown that some of the apparent singularities are simply due to a bad 
choice of coordinates. In § 5.7 we describe the Godel universe and in 
§ 5.8 the Taub-NUT solutions. These probably do not represent the 
actual universe but they are of interest because of their pathological 
global properties. Finally some other exact solutions of interest are 
mentioned in § 5.9. 


5.1 Minkowski space-time 

Minkowski space-time (.4,y) is the simplest empty space-time in 
General Relativity, and is in fact the space-time of Special Relativity. 
Mathematically, it is the manifold R* with a flat Lorentz metric y. 
In terms of the natural coordinates (x1, x?, x3, x4) on R4, the metric y, 
can be expressed in the form 


ds? = — (da4)? + (dar)? + (dx)? + (dx3)?. (5.2) 


If one uses spherical polar coordinates {t,7,0,¢) where af = i, 
x? = rcos0, x2 =rsinécos¢, x! = rsinOsing, the metric takes the 


oem ds? = — df? + dr? +72 (d6?-+sin26 dd). (5.3) 


This metric is apparently singular for r = 0 and sin@ = 0; however 
this is because the coordinates used are not admissible coordinates at 
these points. To obtain regular coordinate neighbourhoods one has to 
restrict the coordinates, e.g. to the ranges 0< r<oa, 0<6<z7, 
0 < ¢ < 27. One needs two such coordinate neighbourhoods to cover 
the whole of Minkowski space. 

An alternative coordinate system is given by choosing advanced 
and retarded null coordinates 7, w defined by v=t+r, w=t—r 
(> v > w). The metric becomes 


ds? = —dudw-+ }(v—w)? (d6?+ sin? 6 d¢?), (5.4) 


where —00 <v <0, —0 <w< oo. The absence in the metric of 
terms in dv®, dw? corresponds to the fact that the surfaces {w = con- 
stant}, {v = constant} are null (ie. w,,w.,g% =0= V.q¥.,9); see 
figure 12. _ 
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t w = constant 


constant 


v = constant 


(i) (ii) 
FicurE 12. Minkowski space. The null coordinate v(w) may be thought of as 
incoming (outgoing) spherical waves travelling at the speed of light; they are 
advanced (retarded) time coordinates. The intersection of a surface 
{v = constant} with a surface {w = constant} is a two-sphere. 
(i) The v, w coordinate surfaces (one coordinate is suppressed). 
(11) The (¢, 7) plane; each point represents a two-sphere of radius r. 


In a coordinate system in which the metric takes the form (5.2), the 
geodesics have the form x%(v) = b%v+c*% where b® and c“ are constants. 
Thus the exponential map exp,: T, >-@ is given by 

a7 (exp, X) = X*+a7(p), 
where X@ are the components of X with respect to the coordinate basis 
{6/éx%} of T,,. Since exp is one-one and onto, it is a diffeomorphism 
between 7, and .@. Thus any two points of 4 can be joined by a 
unique geodesic curve. As exp is defined everywhere on T, for all p, 
(.&,n) is geodesically complete. 

For a spacelike three-surface /, the future (past) Cauchy develop- 
ment D+(.S) (D-(F)) is defined as the set of all points ¢gé-W such that 
each past-directed (future-directed) inextendible non-spacelike curve 
through q intersects F, cf. §6.5. If D+(H) U DS) = M, ie. if every 
inextendible non-spacelike curve in -@ intersects /, then / is said 
to be a Cauchy surface. In Minkowski space-time, the surfaces 
{a4 = constant} are a family of Cauchy surfaces which cover the whole 
of .@. One can however find inextendible spacelike surfaces which are 
not Cauchy surfaces; for example the surfaces 


S, {— (x4)? + (wl)? + (a?)? + (x8)? = o = constant}, 
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where o < 0,24 < 0, are spacelike surfaces which lie entirely inside the 
past null cone of the origin O, and so are not Cauchy surfaces (see 
figure 13). In fact the future Cauchy development of F, is the region 
bounded by &, and the past light cone of the origin. By lemma 4.5.2, 
the timelike geodesics through the origin O are orthogonal to the 
surfaces Y. If reDt(%)UD(L&%) then the timelike geodesic 


through r and O is the longest timelike curve between 7 and Y,. If 


Null 
geodesic 


Future null 

rong on Uniformly 
accelerating 
timelike 


curve 


Surface 
~— {x* = constant} 


Past null Po 
cone of O 


Fieure 13. A Cauchy surface {rt = constant} in Minkowski space-time, and 
spacelike surfaces Y,, %,. which are not Cauchy surfaces. The normal geodesics 
to the surfaces Y,, Y,, all intersect at O. 


however r does not lie in D+(.X%,) U D-(X) there is no longest timelike 
curve between 7 and %: either r lies in the region o > 0, in which case 
there is no timelike geodesic through r orthogonal to &, or r lies in 
the region a < 0, x4 > 0, in which case there is a timelike geodesic 
through r orthogonal to “ but this geodesic is not the longest curve 
between r and Y as it contains a conjugate point to S Nat OE. 
figure 13). 

To study the structure of infinity in Minkowski space-time, we shall 
use the interesting representation of this space-time given by Penrose. 
From the null coordinates v, w, we define new null coordinates in 
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which the infinities of v, « have been transformed to finite values; 
thus we define p, g by tanp =v, tang = w where —}7 < p < 4, 
— $7 <q < 3m (and p > q). Then the metric of (4,n) takes the form 


ds? = sec? psec? q(— dp dq + isin? (p—q) (dé? + sin? 6 d¢?)). 
The physical metric y is therefore conformal to the metric & given by 
ds* = — 4dp dq + sin? (p—q) (d@?+ sin? 6 d¢?). (5.5) 
This metric can be reduced to a more usual form by defining 
=pt+q, r=p—q, 
where —am<t't+r <n, -n<t'-r <7, r>0; (5.6) 
(5.5) is then 
ds? = — (dt’)? + (dr’)? + sin? r’(d6? + sin? 6. d¢?). (5.7) 
Thus the whole of Minkowski space-time is given by the region (5.6) of 
oe ds* = fsec? (4(t’ +7’)) sec? (A(¢’ —1’)) dé? 


where ds? is determined by (5.7); the coordinates ¢, r of (5.3) are 
related to t’, r’ by 


2t = tan ($(t’ +7’)) + tan (4(¢’ —7’)), 
2r = tan (H(t’ +r’) —tan (f(¢’ —7’)). 


Now the metric (5.7) is locally identical to that of the Einstein static 
universe (see § 5.3), which is a completely homogeneous space-time. 
One can analytically extend (5.7) to the whole of the Einstein static 
universe, that is one can extend the coordinates to cover the manifold 
Rix S? where —« < t’ < © and 7’, 0, ¢ are regarded as coordinates 
on S3 (with coordinate singularities at r’ = 0,r' = 7and@0=0,0=7 
similar to the coordinate singularities in (5.3); these singularities can 
be removed by transforming to other local coordinates in a neighbour- 
hood of points where (5.7) is singular). On suppressing two dimensions, 
one can represent the Einstein static universe as the cylinder 
x2+y? = 1 imbedded in a three-dimensional Minkowski space with 
metric ds? = — dé? + dx? + dy? (the full Einstein static universe can be 
imbedded as the cylinder x?+y?+z?+w? = 1 in a five-dimensional 
Euclidean space with metric ds? = —dé?+dz?+dy?+dz?+dw?, cf. 
Robertson (1933)). 

One therefore has the situation: the whole of Minkowski space-time 
is conformal to the region (5.6) of the Einstein static universe, that is, 
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to the shaded area in figure 14. The boundary of this region may there- 
fore be thought of as representing the conformal structure of infinity 
of Minkowski space-time. It consists of the null surfaces p = 37 
(labelled 4+) and q = — 42 (labelled 4—) together with points p = 37, 
q = 4m (labelled i+), p = 47, ¢ =—47 (labelled 1°) and p= — 2m 
q = —4n (labelled i-). Any future-directed timelike geodesic in 


i slip sseant 


¥icuRE 14. The Einstein static universe represented by an imbedded cylinder; 
the coordinates 0, ¢ have been suppressed. Each point represents one half 
of a two-sphere of area 47sin?r’. The shaded region is conformal to the whole of 
Minkowski space-time; its boundary (part of the null cones of i+, 7° and ¢-)may 
be regarded as the conformal infinity of Minkowski space-time. 


Minkowski space approaches i+ (i-) for indefinitely large positive 
(negative) values of its affine parameter, so one can regard any time- 
like geodesic as originating at i- and finishing at 7+ (ef. figure 15(7)). 
Similarly one can regard null geodesics as originating at %- and ending 
at £*, while spacelike geodesics both originate and end at 7°. Thus one 
may regard 7* and ¢~ as representing future and past timelike infinity, 
J* and ¥~- as representing future and past null infinity, and 7° as 
representing spacelike infinity. (However non-geodesic curves do not 
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obey these rules; e.g. non-geodesic timelike curves may start on .4~- 
and end on %+.) Since any Cauchy surface intersects all timelike and 
null geodesics, it is clear that it will appear as a cross-section of the 
space everywhere reaching the boundary at 2°. 
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(i) (ii) 
Figure 15 
(1) The shaded region of figure 14, with only one coordinate suppressed, 
representing Minkowski space-time and its conformal infinity. 
(11) The Penrose diagram of Minkowski space-time; each point represents 
a two-sphere, except for i+,7° and i-, each of which is a single point, and points 
on the line r = 0 (where the polar coordinates are singular). 


One can also represent the conformal structure of infinity by 
drawing a diagram of the (¢’,r’) plane, see figure 15 (ii). As in figure 
12 (ii), each point of this diagram represents a sphere S?, and radial 
null geodesics are represented by straight lines at + 45°. In fact, the 
structure of infinity in any spherically symmetric space-time can be 
represented by a diagram of this sort, which we shall call a Penrose 
diagram. On such diagrams we shall represent infinity by single lines, 
the origin of polar coordinates by dotted lines, and irremovable singu- 
larities of the metric by double lines. 

The conformal structure of Minkowski space we have described is 
what one would regard as the ‘normal’ behaviour of a space-time at 
infinity; we shall encounter different types of behaviour in later 
sections. 

Finally, we mention that one can obtain spaces locally identical to 
(.4,n) but with different (large scale) topological properties by identi- 
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fying points in W which are equivalent under a discrete isometry 
without a fixed point (e.g. identifying the point (x1, x, x°, x4) with the 
point (z!, x2, x3,24+c), where c is a constant, changes the topological 
structure from R? to R3 x S!, and introduces closed timelike lines into 
the space-time). Clearly, (4, y) is the universal covering space for 
all such derived spaces, which have been studied in detail by Auslander 
and Markus (1958). 


5.2 De Sitter and anti-de Sitter space-times 


The space-time metrics of constant curvature are locally characterized 
by the condition Ryjeg = Hs R(GacIsa—GJaaJoc)- Lhis equation is equiva- 
lent to Ciy.g = 0 = Ry —tFo,,; thus the Riemann tensor is determined 
by the Ricci scalar # alone. It follows at once from the contracted 
Bianchi identities that F is constant throughout space-time; in fact 
these space-times are homogeneous. The Einstein tensor is 


Rap = tRG ap eas thas. 


One can therefore regard these spaces as solutions of the field 
equations for an empty space with A = iR, or for a perfect fluid with 
a constant density R/327 and a constant pressure — R/32x. However 
the latter choice does not seem reasonable, as in this case one cannot 
have both the density and the pressure positive; in addition, the 
equation of motion (3.10) is indeterminate for such a fluid. 

The space of constant curvature with R = 0 is Minkowski space- 
time. The space for R > 0 is de Sitter space-time, which has the 
topology &? x S? (see Schrodinger (1956) for an interesting account of 
this space). It is easiest visualized as the hyperboloid 


—v? + w?+ 224 y2 4 22 = a? 
in flat five-dimensional space R> with metric 
— dv? + dw? + dz? + dy?+ dz? = ds? 


(see figure 16). One can introduce coordinates (f, y, 8, @) on the hyper- 
boloid by the relations 


asinh (a-%) =v, acosh(a—t) cosy = w, 
a cosh (alt) sin y cos 6 = 2, ‘cosh (a—4t) sin y sin @ cos¢ = y, 


a cosh («~1t) sin y sin Asin d = z. 
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FIGURE 16. De Sitter space-time represented by a hyperboloid imbedded in 
a five-dimensional flat space (two dimensions are suppressed in the figure). 
(1) Coordinates (t, vy, 9, ) cover the whole hyperboloid; the sections {¢ = con- 
stant} are surfaces of curvature k = +1. 
(ii) Coordinates (¢,2,9,2) cover half the hyperboloid; the surfaces 
{f= constant} are flat three-spaces, their geodesic normals diverging from a 
point in the infinite past. 


In these coordinates, the metric has the form 
ds? = — dé? +a?. cosh? (a4) . {dy? + sin? y(d6? + sin? 6 d¢?)}. 


The singularities in the metric at y = 0, y = 7 and at 6 = 0, 6=17, 
are simply those that occur with polar coordinates. Apart from these 
trivial singularities, the coordinates cover the whole space for 
—0o<t<0,0<y<7,0<0<7,0< ¢< 27. The spatial sections 
of constant ¢ are spheres S* of constant positive curvature and are 
Cauchy surfaces. Their geodesic normals are lines which contract 
monotonically to a minimum spatial separation and then re-expand 
to infinity (see figure 16 (i)). 
One can also introduce coordinates 


wty ax j= ay A az 


t = alog a” ~ we’ w+’ ~ WED 


on the hyperboloid. In these coordinates, the metric takes the form 


ds? = — dé?+exp (2a) (d2?+ dj? + d2?). 
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However these coordinates cover only half the hyperboloid as # is not 
defined for w+v < 0 (see figure 16 (ii)). 

The region of de Sitter space for which v+w > 0 forms the space— 
time for the steady state model of the universe proposed by Bondi and 
Gold (1948) and Hoyle (1948). In this model, the matter is supposed 
to move along the geodesic normals to the surfaces {¢ = constant}. As 
the matter moves further apart, it is assumed that more matter is 
continuously created to maintain the density at a constant value. 
Bondi and Gold did not seek to provide field equations for this model, 
but Pirani (1955), and Hoyle and Narlikar (1964) have pointed out 
that the metric can be considered as a solution of the Einstein equa- 
tions (with A = 0) if in addition to the ordinary matter one introduces 
a scalar field of negative energy density. This ‘C’-field would also be 
responsible for the continual creation of matter. 

The steady state theory has the advantage of making simple and 
definite predictions. However from our point of view there are two 
unsatisfactory features. The first is the existence of negative energy, 
which was discussed in § 4.3. The other is the fact that the space-time 
is extendible, being only half of de Sitter space. Despite these aesthetic 
objections, the real test of the steady state theory is whether its pre- 
dictions agree with observations or not. At the moment it seems that 
they do not, though the observations are not yet quite conclusive. 

de Sitter space is geodesically complete; however, there are points 
in the space which cannot be joined to each other by any geodesic. 
This is in contrast to spaces with a positive definite metric, when 
geodesic completeness guarantees that any two points of a space can 
be joined by at least one geodesic. The half of de Sitter space which 
represents the steady state universe is not complete in the past (there 
are geodesics which are complete in the full space, and cross the 
boundary of the steady state region; they are therefore incomplete in 
that region). 

To study infinity in de Sitter space-time, we define a time coordinate 
t’ by t’ = 2arctan (expat) — 47, 
where —47<t' < dn. (5.8) 
Then ds? = a? cosh? (a—l#’) . ds?, 


where ds? is given by (5.7) on identifying 7’ = y. Thus the de Sitter 
space is conformal to that part of the Einstein static universe defined 
by (5.8) (see figure 17 (i)). The Penrose diagram of de Sitter space is 
accordingly as in figure 17 (ii). One half of this figure gives the Penrose 
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(i) De Sitter space-time is conformal to the region —47 <1?’ < 3m of the 
Einstein static universe. The steady state universe is conformal to the shaded 
region. 

(ii) The Penrose diagram of de Sitter space-time. 

(iii) The Penrose diagram of the steady state universe. 
In (11), (11) each point represents a two-sphere of area 27 sin? vy; null lines are 
at 45°. y = 0 and y = 7 are identified. 


diagram of the half of de Sitter space-time which constitutes the 
steady state universe (figure 17 (iii)). 

One sees that de Sitter space has, in contrast to Minkowski space, 
a spacelike infinity for timelike and null lines, both in the future and 
the past. This difference corresponds to the existence in de Sitter 
space-time of both particle and event horizons for geodesic families 
of observers. 

In de Sitter space, consider a family of particles whose histories are 
timelike geodesics; these must originate at the spacelike infinity %— 
and end at the spacelike infinity .%+. Let p be some event on the world- 
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(i) The particle horizon defined by a congruence of geodesic curves when 
past null infinity .%- is spacelike. 
(ii) Lack of such a horizon if .¥- is null. 


line of a particle O in this family, i.e. some time in its history (proper 
time measured along O's world-line). The past null cone of p is the set 
of events in space-time which can be observed by O at that time. The 
world-lines of some other particles may intersect this null cone; these 
particles are visible to O. However, there can exist particles whose 
world-lines do not intersect this null cone, and so are not yet visible 
to O. At a later time O can observe more particles, but there still exist 
particles not visible to O at that time. We say that the division of 
particles into those seen by O at p and those not seen by O at 9, is the 
particle horizon for the observer O at the event p; it represents the 
history of those particles lying at the limits of O’s vision. Note that it 
is determined only when the world-lines of all the particles in the 
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family are known. If some particle lies on the horizon, then the event p 
is the event at which the particle's creation light cone intersects O’s 
world-line. In Minkowski space, on the other hand, all the other 
particles are visible at any event p on O’s world-line if they move on 
timelike geodesics. As long as one considers only families of geodesic 
observers, one may think of the existence of the particle horizon as a 
consequence of past null infinity being spacelike (see figure 18). 

All events outside the past null cone of p are events which are not, 
and never have been, observable by O up to the time represented by 
the event p. There is a limit to O's world-line on 4+. In de Sitter space— 
time, the past null cone of this point (obtained by a limiting process 
in the actual space-time, or directly from the conformal space-time) 
is a boundary between events which will at some time be observable 
by O, and those that will never be observable by O. We call this surface 
the future event horizon of the world-line. It is the boundary of the past 
of the world-line. In Minkowski space-time, on the other hand, the 
limiting null cone of any geodesic observer includes the whole of 
space-time, so there are no events which a geodesic observer will never 
be able to see. However if an observer moves with uniform acceleration 
his world-line may have a future event horizon. One may think of the 
existence of a future event horizon for a geodesic observer as being 
a consequence of .4+ being spacelike (see figure 19). 

Consider the event horizon for the observer O in de Sitter space-time 
and suppose that at some proper time (event p) on his world-line, his 
light cone intersects the world-line of the particle Q. Then Q is always 
visible to O at times after p. However there is on Q’s world-line an 
event r which lies on O’s future event horizon; O can never see later 
events on Q’s world-line than r. Moreover an infinite proper time 
elapses on O’s world-line from any given point till he observes r, but 
a finite proper time elapses along Q’s world-line from any given event 
to r, which is a perfectly ordinary event on his world-line. Thus O sees 
a finite part of Q’s history in an infinite time; expressed more physi- 
cally, as O observes Q he sees a redshift which approaches infinity as 
O observes points on Q’s world-line which approach r. Correspondingly, 
Q never sees beyond some point on 0’s world-line, and sees nearby 
points on O’s world-line only with a very large redshift. 

At any point on O’s world-line, the future null cone is the boundary 
of the set of events in space-time which O can influence at and after 
that time. To obtain the maximal set of events in space-time that O 
could at any time influence, we take the future light cone of the limit 
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is spacelike. 
(ii) If future infinity consists of a null %+ and 7°, there is no future event 


horizon for a geodesic observer O. However an accelerating observer R may 
have a future event horizon. 


point of O’s world-line on past infinity %-; that is, we take the 
boundary of the future of the world-line (which can be regarded as 
O’s creation light cone). This has a non-trivial existence for a geodesic 
observer only if the past infinity .7~ is spacelike (and is in fact then 
O’s past event horizon). It is clear from the above discussion that 
in the steady state universe, which has a null past infinity for timelike 
and null geodesics and a spacelike future infinity, any fundamental 
observer has a future event horizon but no past particle horizon. 

One can obtain other spaces which are locally equivalent to the de 
Sitter space, by identifying points in de Sitter space. The simplest such 
identification is to identify antipodal points p, p’ (see figure 16) on the 
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hyperboloid. The resulting space isnot time orientable; if time increases 
in the direction of the arrow at p, the antipodal identification implies 
it must increase in the direction of the arrow at p’, but one cannot 
continuously extend this identification of future and past half null 
cones over the whole hyperboloid. Calabi and Markus (1962) have 
studied in detail the spaces resulting from such identifications; they 
show in particular that an arbitrary point in the resulting space can 
be joined to any other point by a geodesic if and only if it is not time 
orientable. 

The space of constant curvature with F < 0 is called anti-de Sitter 
space. It has the topology S!x R%, and can be represented as the 
hyperboloid ye a ee 
in the flat five-dimensional space R® with metric 

ds? = — (du)? — (dv)? + (dx)? + (dy)? + (dz)?. 


There are closed timelike lines in this space; however it is not simply 
connected, and if one unwraps the circle S1 (to obtain its covering 
space R1) one obtains the universal covering space of anti-de Sitter 
space which does not contain any closed timelike lines. This has the 
topology of R*. We shall in future mean by ‘anti-de Sitter space’, this 
universal covering space. 

It can be represented by the metric 


ds? = — dé? + cos*t {dy? + sinh? y(d6? + sin? 6 d¢*)}. (5.9) 


This coordinate system covers only part of the space, and has apparent 
singularities at t = +47. The whole space can be covered by coordi- 
nates {t’,7,0, 6} for which the metric has the static form 


ds? = — cosh? r dt’? + dr? + sinh? 7(d6? + sin? 6 d¢?). 


In this form, the space is covered by the surfaces {¢’ = constant} which 
have non-geodesic normals. 
To study the structure at infinity, define the coordinate 7’ by 


r’ = 2arctan(expr)—47, O<71' < jn. 


Then one finds ds? = cosh?r ds?, where d? is given by (5.7); that is, 
the whole of anti-de Sitter space is conformal to the region 0 <r <n 
of the Einstein static cylinder. The Penrose diagram is shown in 
figure 20; null and spacelike infinity can be thought of as a timelike 
surface in this case. This surface has the topology R! x S?. 


132 EXACT SOLUTIONS [5.2 


Lines 
es constant} 


SS] 
* 
ie) 
Vt 


x 
(ofa 


re 
= jn IN rel Surfaces 
ws tt (’ = constant} 
ST a | t= te) 
Tan P IT Surfaces 
76 Ri es > {¢ = constant 
aly > 
ge: “nese ((=—x} 
ig past 44 Lines 
gear {y = constant} 
(0 Te a LAT ia Null geodesics 
a. yj N a7 Ta from infinity to 7 
s BS 


of 


Timelike ~ 
geodesics 
from p 


Null 
geodesic 
from p 


Coordinate 


e.- 


(ii) 
FIGURE 20 

(i) Universal anti-de Sitter space is conformal to one half of the Einstein 
static universe. While coordinates (t’, 7, 0, @) cover the whole space. coordinates 
(t, vy, 9, ¢) cover only one diamond-shaped region as shown. The geodesics 
orthogonal to the surfaces {f = constant} all converge at p and q, and then 
diverge out into similar diamond-shaped regions. 

(ii) The Penrose diagram of universal anti-de Sitter space. Infinity consists of 
the timelike surface ¥ and the disjoint points 7+, i~. The projection of some 
timelike and null geodesics is shown. 
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One cannot find a conformal transformation which makes timelike 
infinity finite without pinching off the Einstein static universe to a 
point (if a conformal transformation makes the time coordinate finite 
it also scales the space sections by an infinite factor), so we represent 
timelike infinity by the disjoint points i+, i~. 

The lines {y,@,¢ constant} are the geodesics orthogonal to the 
surfaces {¢ = constant}; they all converge to points q (respectively, p) 
in the future (respectively, past) of the surface, and this convergence 
is the reason for the apparent (coordinate) singularities in the original 
metric form. The region covered by these coordinates is the region 
between the surface ¢ = 0 and the null surfaces on which these normals 
become degenerate. 

The space has two further interesting properties. First, as a con- 
sequence of the timelike infinity, there exists no Cauchy surface 
whatever in the space. While one can find families of spacelike 
surfaces (such as the surfaces {t’ = constant}) which cover the space 
completely, each surface being a complete cross-section of the space— 
time, one can find null geodesics which never intersect any given 
surface in the family. Given initial data on any such surface, one 
cannot predict beyond the Cauchy development of the surface; thus 
from the surface {f = 0}, one can predict only in the region covered by 
the coordinates t, y, 6, 6. Any attempt to predict beyond this region is 
prevented by fresh information coming in from the timelike infinity. 

Secondly, corresponding to the fact that the geodesic normals from 
t = 0 all converge at p and gq, all the past timelike geodesics from p 
expand out (normal to the surfaces {ft = constant}) and reconverge 
at qg. In fact, all the timelike geodesics from any point in this space 
(to either the past or future) reconverge to an image point, diverging 
again from this image point to refocus at a second image point, and 
so on. The future timelike geodesics from p therefore never reach Y, in 
contrast to the future null geodesics which go to ¥ from p and form the 
boundary of the future of p. This separation of timelike and null 
geodesics results in the existence of regions in the future of p (i.e. which 
can be reached from p by a future-directed timelike line) which cannot 
be reached from p by any geodesic. The set of points which can be 
reached by future-directed timelike lines from p is the set of points 
lying beyond the future null cone of p; the set of points which can be 
reached from p by future-directed timelike geodesics is the interior of 
the infinite chain of diamond-shaped regions similar to that covered 
by coordinates (¢,7,9,¢). One notes that all points in the Cauchy 
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development of the surface ¢ = 0 can be reached from this surface by 
a unique geodesic normal to this surface, but that a general point 
outside this Cauchy development cannot be reached by any geodesic 
normal to the surface. 


5.3. Robertson—Walker spaces 


So far, we have not considered the relation of exact solutions to the 
physical universe. Following Einstein, we can ask: can one find space— 
times which are exact solutions for some suitable form of matter and 
which give a good representation of the large scale properties of the 
observable universe? If so, we can claim to have a reasonable ‘cosmo- 
logical model’ or model of the physical universe. 

However we are not able to make cosmological models without some 
admixture of ideology. In the earliest cosmologies, man placed himself 
in a commanding position at the centre of the universe. Since the time 
of Copernicus we have been steadily demoted to a medium sized planet 
going round a medium sized star on the outer edge of a fairly average 
galaxy, which is itself simply one of a local group of galaxies. Indeed 
we are now so democratic that we would not claim that our position in 
space is specially distinguished in any way. We shall, following Bondi 
(1960), call this assumption the Copernican principle. . 

A reasonable interpretation of this somewhat vague principle is to 
understand it as implying that, when viewed on a suitable scale, the 
universe is approximately spatially homogeneous. 

By spatially homogeneous, we mean there is a group of isometries 
which acts freely on -@, and whose surfaces of transitivity are space- 
like three-surfaces; in other words, any point on one of these surfaces 
is equivalent to any other point on the same surface. Of course, the 
universe is not exactly spatially homogeneous; there are local irregu- 
larities, such as stars and galaxies. Nevertheless it might seem reason- 
able to suppose that the universe is spatially homogeneous on a large 
enough scale. 

While one can build mathematical models fulfilling this requirement 
of homogeneity (see next section), it is difficult to test homogeneity 
directly by observation, as there is no simple way of measuring the 
separation between us and distant objects. This difficulty is eased by 
the fact that we can, in principle, fairly easily observe isotropies in 
extragalactic observations (i.e. we can see if these observations are the 
same in different directions, or not), and isotropies are closely con- 
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nected with homogeneity. Those observational investigations of iso- 
tropy which have been carried out so far support the conclusion that 
the universe is approximately spherically symmetric about us. 

In particular, it has been shown that extragalactic radio sources are 
distributed approximately isotropically, and that the recently ob- 
served microwave background radiation, where it has been examined, 
is very highly isotropic (see chapter 10 for further discussion). 

It is possible to write down and examine the metrics of all space— 
times which are spherically symmetric; particular examples are the 
Schwarzschild and Reissner—Nordstrém solutions (see § 5.5); however 
these are asymptotically flat spaces. In general, there can exist at most 
two points in a spherically symmetric space from which the space looks 
spherically symmetric. While these may serve as models of space-time 
near a massive body, they can only be models of the universe consistent 
with the isotropy of our observations if we are located near a very 
special position. The exceptional cases are those in which the universe 
is isotropic about every point in space time; so we shall interpret the 
Copernican principle as stating that the universe is approximately 
spherically symmetric about every point (since it is approximately 
spherically symmetric about us). 

As has been shown by Walker (1944), exact spherical symmetry 
about every point would imply that the universe is spatially homo- 
geneous and admits a six-parameter group of isometries whose surfaces 
of transitivity are spacelike three-surfaces of constant curvature. Such 
a space is called a Robertson—Walker (or Friedmann) space (Minkowski 
space, de Sitter space and anti-de Sitter space are all special cases of 
the general Robertson—Walker spaces). Our conclusion, then, is that 
these spaces are a good approximation to the large scale geometry of 
space-time in the region that we can observe. 

In the Robertson—Walker spaces, one can choose coordinates so that 
the metric has the form 

ds? = — dé? + S?(t) do?, 
where do? is the metric of a three-space of constant curvature and is 
independent of time. The geometry of these three-spaces is qualita- 
tively different according to whether they are three-spaces of constant 
positive, negative or zero curvature; by rescaling the function S, one 
can normalize this curvature K to be + 1 or — 1 in the first two cases. 
Then the metric do? can be written 


do? = dy? +f%(y) (d6?+sin?0 dg’), 
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where sny if K=-+1, 


f(x) =%x re 0) 
sinhy if K=-—1. 


The coordinate y runs from 0 to co if K = 0 or —1, but runs from 0 to 
27 if K = +1. When K = 0 or —1, the three-spaces are diffeomorphic 
to R3 and so are ‘infinite’, but when K = + 1 they are diffeomorphic 
to a three-sphere S* and so are compact (‘closed’ or ‘finite’). One could 
identify suitable points in these three-spaces to obtain other global 
topologies; it is even possible to do this, in the case of negative or zero 
curvature, in such a way that the resulting three-space is compact 
(Lobell (1931)). However such a compact surface of constant negative 
curvature would have no continuous groups of isometries (Yano and 
Bochner (1953))—although Killing vectors exist at each point, they 
would not determine any global Killing vector fields and the local 
groups of isometries they generate would not link up to form global 
groups. In the case of zero curvature, a compact space could only have 
a three-parameter group of isometries. In neither case would the 
resulting space-time be isotropic. We shall not make such identifica- 
tions, as our original reason for considering these spaces was that they 
were isotropic (and so had a six-parameter group of isometries). In 
fact the only identifications which would not result in an anisotropic 
space would be to identify antipodal points on S? in the case of constant 
positive curvature. 

The symmetry of the Robertson—Walker solutions requires that the 
energy-momentum tensor has the form of a perfect fluid whose 
density and pressure p are functions of the time coordinate t only, 
and whose flow lines are the curves (y, 0, 6) constant (so the coordinates 
are comoving coordinates). This fluid can be thought of as a smoothed 
out approximation to the matter in the universe; then the function 
S(t) represents the separation of neighbouring flow lines, that is, of 
‘nearby’ galaxies. 


The equation of conservation of energy (3.9) in these spaces takes 
the form 


t= —3(u+p)S'/S. (5.10) 
The Raychaudhuri equation (4.26) takes the form 
4n(u+3p)—A = —38""/8. Gai) 


The remaining field equation (which is essentially (2.35)) can be written 


39°? = 87r(wS?)/S + AS2— 3K. (5.12) 
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Whenever S” + 0, (5.12) can in fact be derived, with an arbitrary value 
of the constant K, as a first integral of (5.10), (5.11); so the real effect 
of this field equation is to identify the integration constant as the 
curvature of the metric do? of the three-spaces {t = constant}. 

It is reasonable to assume (cf. the energy conditions, § 4.3) that is 
positive and p is non-negative. (In fact, present estimates are 
10-*% gm cm~* > ply > 10-4 gmem-3, wy > py > 0). Then, if A is zero, 
(5.11) shows that S cannot be constant; in other words the field equa- 
tions then imply the universe is either expanding or contracting. 
Observations of other galaxies show, as first found by Slipher and 
Hubble, that they are moving away from us, and so indicate that the 
matter in the universe is expanding at the present time. Current 
observations give the value of S'/S at the present time as 


H =(S'/S)|, ~ 10-year“, 


believed correct to within a factor 2. From this, (5.11) shows that if 
A is zero, S must have been zero a finite time ¢, ago (that is, a time ¢, 
measured along the world-line of our galaxy) where 


t, < H ~& 10” years. 


From (5.10) it follows that the density decreases as the universe 
expands, and conversely that the density was higher in the past, 
increasing without bound as S+0. This is therefore not merely a 
coordinate singularity (as for example, in anti-de Sitter universe 
expressed in coordinates (5.9)); the fact that the density isinfinite there 
shows that some scalar defined by the curvature tensor is also infinite. 
It is this that makes the singularity so much worse than in the corre- 
sponding Newtonian situation; in both cases the world-lines of all the 
particles intersect in a point and the density becomes infinite, but here 
space-time itself becomes singular at the point S = 0. We must there- 
fore exclude this point from the space-time manifold, as no known 
physical laws could be valid there. 

This singularity is the most striking feature of the Robertson— 
Walker solutions. It occurs in all models in which 4 + 3p is positive 
and A is negative, zero, or with not too large a positive value. It would 
imply that the universe (or at least that part of which we can have any 
physical knowledge) had a beginning a finite time ago. However this 
result has here been deduced from the assumptions of exact spatial 
homogeneity and spherical symmetry. While these may be reasonable 
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approximations on a large enough scale at the present time, they 
certainly do not hold locally. One might think that, as one traced the 
evolution of the universe back in time, the local irregularities would 
grow and could prevent the occurrence of a singularity, causing the 
universe to ‘bounce’ instead. Whether this could happen, and whether 
physically realistic solutions with inhomogeneities would contain 
singularities, is a central question of cosmology and constitutes the 
principal problem dealt with in this book; it will turn out that there is 
good evidence to believe that the physical universe does in fact become 
singular in the past. 

If some suitable relation between p and y is specified, (5.10) can be 
integrated to give wz as a function of S. In fact the pressure is very 
small at the present epoch. If one takes it and A to be zero, one finds 
from (5.10) 4m u 


pal’ = Ss?’ 
where JM is a constant, and (5.12) becomes 


3S"2-—6M/S =—3K = E/M. (5.13) 


The first equation expresses the conservation of mass when the pres- 
sure is zero, while the second (the Friedmann equation) isan energy 
conservation equation for a comoving volume of matter; the constant 
E represents the sum of the kinetic and potential energies. If E is 
negative (i.e. K is positive), S will increase to some maximum value 
and then decrease to zero; if E is positive or zero (i.e. K is negative or 
zero), S will increase indefinitely. 

The explicit solutions of (5.13) have a simple form if given in terms 
of a rescaled time parameter 7(t), defined by 


dr/dt = S-\(t); (5.14) 
they take the form 
S = (H/3)(coshr—1),  ¢ = (E/3) (sinh r—7), ee = Sale 
S77 fSAr, if K=0; 


S = (— E/3) (1—cos7), t=(—H/3)(r-sin7), if K=1. 
(The case K = 0 is the Einstein-de Sitter universe; clearly S oc #8.) 


If p is non-zero but positive, the qualitative behaviour is the same. 
In particular if p = (y— 1) wherey isa constant, 1 < y < 2, one finds 
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37 = M/S*7, and the solution of (5.12) near the singularity takes the 


form 
S oc (2/87, 


If A is negative, the solution expands from an initial singularity, 
reaches a maximum and then recollapses to a second singularity. If 
A is positive, then for K = 0 or —1 the solution expands forever and 
asymptotically approaches the steady state model. For K = + 1 there 
are several possibilities. If A is greater than some value A,r 
(Agit =(— £/311)3/(3.2)? if p = 0) the solution will start from an 
initial singularity and will expand forever asymptotically approaching 
the steady state model. If A = A,,,, there is a static solution, the 
Einstein static universe. (The metric form (5.7) is that of the particular 
Einstein static solution for which «+p = (47)-1, A = 1+ 87.) There 
is also a solution which starts from an initial singularity and asympto- 
tically approaches the Einstein universe, and one which starts from the 
Einstein universe in the infinite past and expands forever. If A < Agr, 
there are two solutions—one expands from an initial singularity and 
then recollapses to a second singularity; the other contracts from an 
infinite radius in the infinite past, reaches a minimum radius, and then 
re-expands. This and the universe asymptotic to the static universe 
in the infinite past are the only solutions which could represent the 
observed universe and which do not have a singularity. In these 
models, S"’ is always positive, and this seems to be in conflict with 
observations of redshifts of distant galaxies (Sandage (1961, 1968)). 
Also, the maximum density in these models would not have been very 
much larger than the present density. This would make it difficult to 
understand phenomena such as the microwave background radiation 
and the cosmic abundance of helium, which seem to point to a very 
hot dense phase in the history of the universe. 

Just as in the previous cases we have studied, one can find conformal 
mappings of the Robertson—Walker spaces into the Einstein static 
space. We use the coordinate 7 defined by (5.14) as a time coordinate; 
then the metric takes the form 


ds? = S%(r){—dr?-+ dy? +f2(x) (d0?+sin?0dg2)}. (5.15) 


In the case K = +1, this is already conformal to the Einstein static 
space (put 7 = ¢’, y = 7’ to agree with the notation of (5.7)). Thus these 
spaces are mapped into precisely that part of the Einstein static space 
determined by the values taken by 7. When p = A = 0, 7 lies in the 
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range 0 < 7 < 7, so the whole space is mapped into this region in the 
Einstein static universe while its boundary is mapped into the three- 
spheres 7 = 0, 7 = 7. (If p > 0, it is mapped into a region for which 
7 takes values 0 < 7 < a < 7, for some number a.) In the case K = 0, 
the same coordinates represent the space as conformal to flat space 
(see (5.15)), so on using the conformal transformations of §5.1, one 
obtains these spaces mapped into some part of the diamond repre- 
senting Minkowski space-time in the Einstein static universe (see 
figure 14); the actual region is again determined by the values taken 
by 7. When A = 0, 0 < 7 < ©, so this space (which is the Einstein— 
de Sitter space when p = 0) is conformal to the half v’ > 0 of the 
diamond which represents Minkowski space-time. In the case K = — 1, 
one obtains the metric conformal to part of the region of the Einstein 
static space for which $7 2>t’+7r’ > —4m, 47 > t'—-r’' > —4, on 
defining 
t’ = arctan (tanh $(7+ y)) + arc tan (tanh 3(r—)), 


r’ = arctan (tanh 4(7+ y)) — arctan (tanh 3(7— y)). 


The part of this diamond-shaped region covered depends on the range 
of 7; when A = 0, the space is mapped into the upper half. 

One thus obtains these spaces and their boundaries conformal to 
some (generally finite) region of the Einstein static space, see figure 
21 (i). However there is an important difference from the previous 
cases: part of the boundary is not ‘infinity’ in the sense it was previ- 
ously, but represents the singularity when S = 0. (The conformal 
factor can be thought of as making infinity finite by giving an infinite 
compression, but making the singular point S = 0 finite by an infinite 
expansion.) In fact this makes little difference to the conformal dia- 
grams; one can give the Penrose diagrams as before (see figures 21 (ii) 
and 21 (ili)). In each case when p > 0 the singularity at t = 0 is repre- 
sented by a spacelike surface; this corresponds to the existence of 
particle horizons (defined precisely as in § 5.2) in these spaces. Also 
when K = + 1 the future boundary is spacelike, implying the existence 
of event horizons for the fundamental observers; when K = 0 or —1 
and A = 0, future infinity is null and there are no future event horizons 
for the fundamental observers in these spaces. 

At this stage, one should examine the following question: anti- 
de Sitter space could be expressed'in the Robertson—Walker form (5.9) 
and then expressed conformally as part of the Einstein static universe. 
When one did so, one found that the Robertson—Walker coordinates 


5.3] ROBERTSON-WALKER SPACES 141 


{f = constant} = 0) 
World-lines of yes 
fund tal 
a See = aS 
(coordinate baa SSS jo 


singularity) %~(7 = 0) ~_s ali J-(t = 0) 
(coordinate (coordinate 
singularity) singularity) 
(ii) (iii) 
FIcurE 21 


(1) The Robertson—Walker spaces (p = A = 0) are conformal to the regions 
of the Einstein static universe shown, in the three cases K = +1, 0 and —1. 

(i) Penrose diagram of a Robertson—Walker space with K =+1 and 
ae Naa). 

(iii) Penrose diagram of a Robertson—Walker space with K = 0 or —1 and 
pi Nee 


covered only a small part of the full space-time. That is to say, the 
space-time described by the Robertson—Walker coordinates could be 
extended. One should therefore show that the Robertson—Walker 
universes in which there is matter are in fact inextendible. This 
follows because one can show that if u > 0, p > 0 and X is any vector 
at any point q, the geodesic y(v) through g = (0) in the direction of X 
is such that either 
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(i) y(v) can be extended to arbitrary positive values of v, or 
(ii) there is some vy > 0 such that the scalar invariant 
(Ry; — 49,3) (RY — 3 Rg") = (4+ AP +3(p— A)? 
is unbounded on y([0, %)). 

It is now clear that the surfaces {t = constant} are Cauchy surfaces 
in these spaces. Further one sees that the singularity is universal in the 
following sense: all timelike and null geodesics through any point in 
the space approach it for some finite value of their affine parameter. 


5.4 Spatially homogeneous cosmological models 


We have seen that there are singularities in any Robertson—Walker 
space-time in which yw > 0, p > 0 and A is not too large. However one 
could not conclude from this that there would be singularities in 
more realistic world models which allow for the fact that the universe 
is not homogeneous and isotropic. In fact, one does not expect to find 
that the universe can be very accurately described by any attainable 
exact solution. However one can find exact solutions, less restricted 
than the Robertson—Walker solutions, which may be reasonable 
models of the universe, and see if singularities occur in them or not: 
the fact that singularities do occur in such models gives an indication 
that the existence of singularities may be a general property of all 
space-times which can be regarded as reasonable models of the 
universe. 

A simple class of such solutions are those in which the requirement 
of isotropy is dropped but the requirement of spatial homogeneity (the 
strict Copernican principle) is retained (although the universe seems 
approximately isotropic at the present time, there might have been 
large anisotropies at an earlier epoch). Thus in these models one 
assumes there exists a group of isometries G, whose orbits in some part 
of the model are spacelike hypersurfaces. (The orbit of a point p under 
the group G, is the set of points into which p is moved by the action of 
all elements of the group.) These models may be constructed locally by 
well-known methods; see Heckmann and Schiicking (1962) for the 
case r= 3, and Kantowski and Sachs (1967) for the case r = 4 (if 
r > 4, the space-time is necessarily a Robertson—Walker space). 

The simplest spatially homogeneous space—times are those in which 
the group of isometries is Abelian; the group is then of type I in the 
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classification given by Bianchi (1918), so we call these Bianchi I spaces. 
We discuss Bianchi I spaces in some detail, and then give a theorem 
showing singularities will occur in all non-empty spatially homogene- 
ous models in which the timelike convergence condition (§4.3) is 
satisfied. 

Suppose the spatially homogeneous space-time has an Abelian 
isometry group; for simplicity we assume A = 0 and that the matter 
content is a pressure-free perfect fluid (‘dust’). Then there exist 
comoving coordinates (é, x,y,z) such that the metric takes the form 


ds? = — di? + X?(f) dz? + Y2(£) dy? + Z(t) dz?. (5.16) 
Defining the function S(t) by S* = X YZ, the conservation equations 
show that the density of matter is given by 47 = M/S, where M is 
a suitably chosen constant. The general solution of the field equations 
can be written 


X= S(t/8)28 «, Y = S(t8/S)2s eran, 


T= S(t8/S)25™ (a+$7) 
where S is given by S3 = 2Mt(t+ ); 


x (> 0)isa constant determining the magnitude of the anisotropy (we 
exclude the isotropic case (X = 0), which is the Einstein—de Sitter 
universe (§ 5.3)), and a(—47 < a < 37) is a constant determining the 
direction in which the nfost rapid expansion takes place. The average 
rate of expansion is given by 


Si 2t+2/2, 


S 2a 


the expansion in the z-direction is 


X° 2t+Z(1+2sina)/2 
3 t+ ; 
and the expansions Y’/Y, Z'/Z in the y, z directions are given by 
similar expressions in which a is replaced by « + 37, a + 7 respectively. 
The solution expands from a highly anisotropic singular state at 
f = 0, reaching a nearly isotropic phase for large ¢ when it is nearly the 
same as the Einstein—de Sitter universe. The average length S increases 
monotonically as ¢ increases, its initial high rate of change (S oc i for 
small t) decreasing steadily (S oc ¢3 for large ¢). Thus the universe 
evolves more rapidly, at early times, than its isotropic equivalent. 
Suppose one considers the time-reverse of the model, and follows 
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this forward in time towards the singularity. The initially almost 
isotropic contraction will become very anisotropic at late times. For 
general values of a, i.e. @ + 47, the term 1 + 2sin (a + $7) will be nega- 
tive. Thus the collapse in the z-direction would halt, and, for suffi- 
ciently early times, be replaced by an expansion, the rate of expansion 
becoming indefinitely large for early enough times. In the x- and 
y-directions, on the other hand, the collapse would continue mono- 
tonically towards the singularity. Thus if one considers the forward 
direction of time in the original model, one has a ‘cigar’ singularity: 
matter collapses in along the z-axis from infinity, halts, and then 
starts re-expanding, while in the x- and y-directions the matter 
expands monotonically at all times. If one could receive signals from 
early enough times in such a model, one would see a maximum red- 
shift in the z-direction, at earlier times matter in this direction being 
observed with progressively smaller redshifts and then with in- 
definitely increasing blue-shifts. 

The behaviour in the exceptional case a = 47 is rather different. In 
this case, the terms 1 + 2sin (@ + 37) and 1+ 2sin (« + 477) both vanish. 
Thus the expansions in the axis directions are 

eet | 2 ee 


0c eae eA 


t+>° < 


If one follows the time-reversed model, the rate of collapse in the 
y- and z-directions slows asymptotically down to zero, while the rate 
of collapse in the x-direction increases indefinitely. In the original 
model, one hasa ‘pancake’ singularity: matter expands monotonically 
in all directions, starting from an indefinitely high expansion rate in 
the x-direction but from zero expansion rates in the y- and z-directions. 
Indefinitely high redshifts would be seen in the x-direction, but there 
would be limiting redshifts in the y- and <-directions. 

Further examination shows that in the general (‘ cigar’) case, there 
is a particle horizon in every direction despite the anisotropic expan- 
sion. However in the exceptional (‘pancake’) case, no horizon occurs 
in the x-direction; in fact the particles that can be seen by an observer 
at the origin at time ¢) are characterized by coordinate values (ZG) 
lying within the infinite cylinder 


ey? < p? 


2 (/9M + 9) 
where De maltese (+2) = ts) 
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While we have here considered these models for vanishing pressure 
and A term only, properties of these spaces with more realistic matter 
contents can easily be obtained; for example if one has either a perfect 
fluid with p = (y—1), y a constant (1 < y < 2), or a mixture of 
a photon gas and matter with pressure p < 4, the behaviour near the 
singularity is the same as in the dust case. 

An interesting consequence of the non-existence of a particle 
horizon in the x-direction in the exceptional (‘pancake’) case, is that 
one can extend the solution continuously across the singularity. We 
shall show this explicitly in the case of the dust solution. 

The metric takes the form (5.16) where now 


X(t) = ¢(@Mi¢+=X))4, FY) = Zi) = (BMi(t+d))§. (5.17) 


We now choose new coordinates 7, 7 which satisfy the equations 


4 ft dt 
= | | ee eee ee 
tanh (27/9.722) = y/7, exp (sz|,x0) 7? — 9. 
One then finds that the space with metric (5.16), (5.17) is given in the 
new coordinates by 


ds? = A(t) (—dr?+dy?) + B(t) (dy? + dz?) (5.18) 
where 


Ait) = exp(-=).gare+2)4 Bit) = (2M(t+ Z))3, (5.19) 


the whole space (for ¢ > 0) being mapped into the region VY defined by 
7 > 0,7?—7? > 0. The function é(7, 7) is now defined implicitly as the 
solution of the equation 


= 7 = gMtexp—“ 4") (5.20) 


for whicht > 0. The (7,7) plane is given in conformally flat coordinates. 
The region V in this plane, bounded by the surface t = 0, is shown in 
figure 22. In this diagram, the world-lines of the particles are straight 
lines diverging from the origin. 

The functions A(t), B(t) are continuous as > 0 from above. One can 
therefore extend the solution continuously to the whole (7,7) plane 
by specifying that (5.19) holds everywhere, (5.20) holds inside Y, and 


iit t(r,) = 0 


holds outside Y. Then (5.18) is a C° metric which is a solution of the 
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Ficure 22. Dust-filled Bianchi I space with a pancake singularity. 
(i) The (7,7) plane; null lines are at + 45°. 


[5.4 


(ii) A half-section of the space in (7, 7, y) coordinates (the z-coordinate is 
suppressed), showing the past light cone of the point P = (T,, 0, 0). There is a 


particle horizon in the y-direction but not in the x- (i.e. 7) direction. 


field equations equivalent to (5.16), (5.17) inside ¥, and is a flat 
space-time outside ¥°. However the solution is not C! across the 
boundary of ¥, and in fact the density of matter becomes infinite on 
this boundary (as S-+>0 there). Since the first derivatives are not 
square integrable, the Einstein field equations cannot be interpreted 
on the boundary even in a distributional sense (see § 8.4). While the 
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extension onto the boundary is unique, it is in no way unique beyond 
the boundary. We have carried out the extension in the case of dust; 

a similar extension could be carried out if one had a mixture of matter 
and radiation. 

Let us now return to considering general non- -empty spatially homo- 
geneous models. The existence of a singularity in these models will 
follow directly from Raychaudhuri’s equation if the motion of the 
matter is geodesic and without rotation (as must be the case, for 
example, if the world-lines are orthogonal to the surfaces of homo- 
geneity) and the timelike convergence condition is satisfied; however 
there exist such spaces in which the matter accelerates and rotates, 
and either of these factors could possibly prevent the existence of a 
singularity. The following result, which is an improved version of a 
theorem of Hawking and Ellis (1965), shows that in fact neither 
acceleration nor rotation can prevent the existence of singularities in 
these models. 


Theorem 
(4, 9) cannot be timelike geodesically complete if: 

(1) R,,A°K° > 0 for all timelike and null vectors K (this is true 
if the energy-momentum tensor is type I (§4.3) and n+p, > 0, 
ee ee > 0); 


(2 ) there exist ee of motion for the matter fields such that 
the Cauchy problem has a unique solution (see chapter 7); 

(3) the Cauchy data on some spacelike three-surface # is invariant 
under a group of diffeomorphisms of # which is transitive on #. 


Since the intrinsic geometry of # is invariant under a transitive 
group of diffeomorphisms, these are isometries and # is complete, 
i.e. cannot have any boundary. It can be shown (see § 6.5) that if there 
is anon-spacelike curve which intersects # more than once, then there 
exists a covering manifold AM of M inwhich each connected component 
of the image of # will not intersect any non-spacelike curve more than 
once. We shall assume that ./@ is timelike geodesically complete, and 
show that this is inconsistent with conditions (1), (2) and (3). 

Let # bea connected component of the image of # in Me. By (3), 
the Cauchy data on H is homogeneous. Therefore by condition (2), 
the Cauchy development of any region of Hi is isometric to the Cauchy 
development of any other similar region of HK. This implies that the 
surfaces {s = constant} are homogeneous if they lie within the Cauchy 
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development of H, where s is the distance from # measured along 
the geodesic normals to H#. These surfaces must lie either entirely 
within or entirely outside the Cauchy development of #, as otherwise 
there would be equivalent regions in H# which had inequivalent 
Cauchy evolutions. The surfaces {s = constant} will lie in the Cauchy 
development of H a long as they remain spacelike, because the 
boundary of the Cauchy development of ce (if it exists) must be null 
($6.5). , 

The geodesics orthogonal to #% will be orthogonal to the surfaces 
{s = constant}, as a vector representing the separation of points equal 
distances along neighbouring geodesics will remain orthogonal to the 
geodesics if it is so initially. As in § 4.1, one can represent the spatial 
separation of neighbouring geodesics orthogonal to Ha by a matrix A 
which is the unit matrix on #. By homogeneity, it will be constant on 
the surfaces {s = constant} while these lie in the Cauchy development 
of #. While A is non-degenerate, the map from # to a surface 
{s = constant} defined by the normal geodesics will be of rank three 
and so the surfaces will be spacelike three-surfaces contained within 
the Cauchy development of KH. The expansion 


0 = (det A)-1d (det A)/ds 


of these geodesics obeys Raychaudhuri’s equation (4.26) with the 
vorticity and acceleration zero. By condition (1), R,, V*V® is positive 
for all timelike vectors V*. Thus 6 will become infinite and A will be 
degenerate for some finite positive or negative value s, of s. The map 
from # to the surface s = sy can have at most rank two; there will 
therefore be at least one vector field Z on # such that AZ = 0. The 
integral curves of this vector field are curves in .¥ which are mapped by 
the geodesic normals to one point in the surface s = 8. Thus this 
surface will be at most two-dimensional. As the geodesics lie in the 
Cauchy development of .¥ for |s| < |s9|, the surface s = Sq will lie in 
the Cauchy development or on the boundary of the Cauchy develop- 
ment of .#. By condition (1), the energy-momentum tensor has a 
unique timelike eigenvector at each point. These eigenvectors will 
form a C? timelike vector field whose integral curves may be thought 
of as representing the flow lines of the matter. As the surface s = So lies 
in the Cauchy development of .#’ or on its boundary, all the flow lines 
that pass through it must intersect .7’. But then as ¥ is homogeneous, 
all the flow lines that pass through H# must pass through s = s,. Thus 
the flow lines define a diffeomorphism between # and the surface 
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s e ° . ” . . . . 
8 = 8). This is impossible, as ¥ is three-dimensional and s = So 18 
two-dimensional. 0 


In fact, if all the flow lines were to pass through a two-dimensional 
surface, one would expect the matter density to become infinite. We 
have now seen that a large scale rotation or acceleration cannot, by 
itself, prevent the occurrence of singularities in a universe model 
obeying the strict Copernican principle. In later theorems we shall see 
that irregularities are in general also unable to prevent the occurrence 
of singularities in world models. 


5.5 The Schwarzschild and Reissner-Nordstrém solutions 


While the spatially homogeneous solutions may be good models for the 
large scale distribution of matter in the universe, they are inadequate 
for describing, for example, the local geometry of space-time in the 
solar system. One can describe this geometry to a good approximation 
by the Schwarzschild solution, which represents the spherically sym- 
metric empty space-time outside a spherically symmetric massive 
body. In fact, all the experiments which have so far been carried out 
to test the difference between the General Theory of Relativity and 
Newtonian theory are based on predictions by this solution. 
The metric can be given in the form 


ds? = — (1 = =) dt? + (1 -*) “dr? +7?(d6?+ sin?4d¢2), (5.21) 
where 7 > 2m. It can be seen that this space-time is static, i.e. 0/ét is 
a timelike Killing vector which is a gradient, and is sphcrically sym- 
metric, i.e. is invariant under the group of isometries SO(3) operating 
on the spacelike two-spheres {t,r constant} (cf. appendix B). The 
coordinate r in this metric form is intrinsically defined by the require- 
ment that 47? is the area of these surfaces of transitivity. The solution 
is asymptotically flat as the metric has the form Jab = Nop + O(A/r) for 
large r. Comparison with Newtonian theory (cf. $3.4) shows that m 
should be regarded as the gravitational mass, as measured from 
infinity, of the body producing the field. It should be emphasized that 
this solution is unique: if any solution of the vacuum field equations 
is spherically symmetric, it is locally isometric to the Schwarzschild 
solution (although it may of course look totally different if it is given 
in some other coordinate system; see appendix B and Bergmann, 
Cahen and Komar (1965)). 


6 HLS 
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Normally one would regard the Schwarzschild metric for r greater 
than some value r, > 2m as being the solution outside some spherical 
body, the metric inside the body (r < 7») having a different form 
determined by the energy-momentum tensor of the matter in the 
body. However it is interesting to see what happens when the metric 
is regarded as an empty space solution for all values of r. 

The metric is then singular when r = 0 and when r = 2m (there are 
also the trivial singularities of polar coordinates when 6 = 0 and 
6 = 7). One must therefore cut r = 0 and r = 2m out of the manifold 
defined by the coordinates (¢,7, 6, ¢), since in § 3.1 we took space-time 
to be represented by a manifold with a Lorentz metric. Cutting out the 
surface r = 2m divides the manifold into two disconnected components 
for which 0 < r < 2m and 2m < r < o. Since we took the space-time 
manifold to be connected, we must consider only one of these com- 
ponents and the obvious one to choose is the one for r > 2m, which 
represents the external field. One must then ask whether this manifold 
& with the Schwarzschild metric ¢ is extendible, i.e. whether there 
is a larger manifold @ into which .@ can be imbedded and a suitably 
differentiable Lorentz metric §’ on 4’ which coincides with g on the 
image of .@. The obvious place where 4 might be extended is where 
r tends to 2m. A calculation shows that although the metric is singular 
at r = 2m in the Schwarzschild coordinates (t,7,0,¢), no scalar poly- 
nomials of the curvature tensor and the metric diverge as r > 2m. This 
suggests that the singularity at r = 2m is not areal physical singularity, 
but rather one which is a result of a bad choice of coordinates. 

To confirm this, and to show that (.#, $) can be extended, define 


eh dr 
= =r = r+ 2mlog (r—2m). 


Then v=et+r* 
is an advanced null coordinate, and 
w=t—r* 


is a retarded null coordinate. Using coordinates (v, 7,4, ¢) the metric 
takes the Eddington—Finkelstein form g’ given by 
ze 2m 
ds? = — (1 =) dv? + 2dv dr + r2(d6? + sin? 6 dd?). (5.22) 
The manifold -@ is the region 2m < r < 0, but the metric (5.22) is 
non-singular and indeed analytic on the larger manifold W for which 


5.5] THE SCHWARZSCHILD SOLUTION 151 


0<r< oo. The region of (.#’, $’) for which 0 < r < 2m is in fact 
isometric to the region of the Schwarzschild metric for which 
0<r< 2m. Thus by using different coordinates, ie. by taking a 
different manifold, we have extended the Schwarzschild metric so that 
it is no longer singular at 7 = 2m. In the manifold -# the surface 
r = 2m is a null surface, as can be seen from the Finkelstein diagram 
(figure 23). This is a section (0,4 constant) of the space-time; each 
point represents a two-sphere of area 47r?. Some null cones and radial 
null geodesics are indicated on this diagram. Surfaces {t = constant} 
are indicated; one sees that ¢ becomes infinite on the surface r = 2m. 

This representation of the Schwarzschild solution has the odd 
feature that it is not time symmetric. One might expect this from the 
cross term (dv dr) in (5.22); it is qualitatively clear from the Finkelstein 
diagram. The most obvious asymmetry is that the surface r = 2m acts 
as a one-way membrane, letting future-directed timelike and null 
curves cross only from the outside (7 > 2mm) to the inside (r < 2m). Any 
past-directed timelike or null curve in the outside region cannot cross 
into the inside region. No past-directed timelike or null curve within 
7 = 2m ean approach r = 0. However any future-directed timelike or 
null curve which crosses the surface r = 2m approaches r = 0 within 
a finite affine distance. As r—> 0, the scalar R?°¢R,,,, diverges as m?/r6, 
Therefore r = 0 is a real singularity; the pair (-4’, 8’) cannot be 
extended in a C? manner or in fact even in a C° manner across 7 = 0. 

If one uses the coordinate w instead of v, the metric takes the form 
s” given by 

dase (1 -=") dw? — 2 dw dr +r2(d62+sin? 0 d¢2). 


This is analytic on the manifold -@” defined by the coordinates 
(w,r, 0,6) for 0<r< oo. Again the manifold -@ is the region 
2m <r < coand the new region 0 < r < 2m is isometric to the region 
0 <r < 2m of the Schwarzschild metric, but the isometry reverses 
the direction of time. In the manifold -@”, the surface r = 2m is again 
a null surface which acts as a one-way membrane. However this time 
it acts in the other direction of time, letting only past-directed time- 
like or null curves cross from the outside (r > 2m) to the inside 
(r < 2m). 

One can in fact make both extensions (.4’, 8’) and (-#”, §”) simul- 
taneously; that is to say, there is a still larger manifold M* with 
metric &* into which both (4, 8’) and (.4@”, 8”) can be isometrically 
imbedded, so that they coincide on the region r > 2m which is 
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Ficure 23. Section (@,¢) constant of the Schwarzschild solution. 

(i) Apparent singularity at r = 2m when coordinates (é,r) are used. 

(1) Finkelstein diagram obtained by using coordinates (v, 7) (lines at 45° are 
lines of constant v). Surface 7 = 2m is a null surface on which t = ©. 
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isometric to (.#, §). A construction of this larger manifold has been 
given by Kruskal (1960). To obtain it, consider (-@, 8) in the coordi- 
nates (v, w, 0, ); then the metric takes the form 


| 


2 
1 -=) dv dw +r2(d62 + sin? 0 d¢2), 
where 7 is determined by 
3(v—w) = r+ 2m log (r— 2m). 


This presents the two-space (6,¢ constant) in null conformally flat 
coordinates, as the space with metric ds? = — dv dw is flat. The most 
general coordinate transformation which leaves this two-space 
expressed in such conformally flat double null coordinates is v’ = v’(v), 
w’ = w'(w) where v’ and w’ are arbitrary C1 functions. The resulting 
metric is 


2m\ dv dw 
Sa a ee | ay! ¢ 2 2 in2 2 
dso (1 5 4 ; dv’ dw’ + r?(d6? + sin? 6 d¢?). 


To reduce this to a form corresponding to that obtained earlier for 
Minkowski space-time, define 
v=h(v'-w’), t= h(v'+w’). 
The metric takes the final form 
ds? = F(t’, x’) (—dt’*+dzx"?) + 7r2(t’, x’) (dé? +sin?6d¢?). (5.23) 


The choice of the functions v’, w’ determines the precise form of the 


metric. Kruskal’s choice was v’ = exp(v/4m), w’ = —exp(—w/4m). 
Then 7 is determined implicitly by the equation 
(t’)? — (a’)® = —(r— 2m) exp (r/2m) (5.24) 
and F is given by 
F? = exp(—1/2m) .16m?/r. (5.25) 


On the manifold .@* defined by the coordinates (é’,x’,0,¢) for 
(t’)2— (x’)? < 2m, the functions r and F’ (defined by (5.24), (5.25)) are 
positive and analytic. Defining the metric g* by (5.23), the region I of 
(.@*, $*) defined by 2’ > |t’| is isometric to (4,8), the region of the 
Schwarzschild solution for which r > 2m. The region defined by 
a’ > —t' (regions I and II in figure 24) is isometric to the advanced 
Finkelstein extension (.4’, 8’). Similarly the region defined by 2’ > t’ 
(regions I and II’ in figure 24) is isometric to the retarded Finkelstein 
extension (.#”, $8”). There is also a region I’, defined by «’ < —|t’|, 
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Ficure 24. The maximal analytic Schwarzschild extension. The 6, 6 coordinates 
are suppressed; null lines are at + 45°. Surfaces {r = constant} are homogeneous. 

(i) The Kruskal diagram, showing asymptotically flat regions I and I’ and 
regions II, IT’ for which r < 2m. 


(ii) Penrose diagram, showing conformal infinity as well as the two 
singularities. 


which turns out to be again isometric with the exterior Schwarzschild 
solution (.4%, 8). This can be regarded as another asymptotically flat 
universe on the other side of the Schwarzschild ‘throat’. (Consider the 
section t = 0. The two-spheres {r = constant} behave as in Euclidean 
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space, for large 7; however for small 7, they have an area which 
decreases to the minimum value 1677m2and then increases again, asthe 
two spheres expand into the other asymptotically flat three-space.) 
The regions I’ and II are isometric with the advanced Finkelstein 
extension of region I’, and similarly I’ and II’ are isometric with the 
retarded Finkelstein extension of I’, as can be seen from figure 24. 
There are no timelike or null curves which go from region I to region I’. 
All future-directed timelike or null curves which cross the part of the 


surface r = 2m represented here by ¢’ = |z’| approach the singularity 
at t’ = (2m + (x’)?)?, where r = 0. Similarly past-directed timelike or 
null curves which cross t’ = —|2‘| approach another singularity at 


t’ = —(2m+ (x’)*)?, where again r = 0. 

The Kruskal extension (.@*, 8*) is the unique analytic and locally 
inextendible extension of the Schwarzschild solution. One can con- 
struct the Penrose diagram of the Kruskal extension by defining new 
advanced and retarded null coordinates 


v” = arctan (v'(2m)-4), w” = arctan (w’(2m)-#) 
for —am<v"+w"<m and —}1<v" < hn, —in<w" <4a 


(see figure 24 (ii)). This may be compared with the Penrose diagram 
for Minkowski space (figure 15 (ii)). One now has future, past and null 
infinities for each of the asymptotically flat regions I and I’. Unlike 
Minkowski space, the conformal metric is continuous but not differ- 
entiable at the points 2°. 

If we consider the future light cone of any point outside r = 2m, 
the radial outwards geodesic reaches infinity but the inwards one 
reaches the future singularity; if the point lies inside r = 2m, both these 
geodesics hit the singularity, and the entire future of the point is ended 
by the singularity. Thus the singularity may be avoided by any 
particle outsider = 2m (so it isnot ‘universal’ as it isin the Robertson— 
Walker spaces), but once a particle has fallen inside r = 2m (in region 
IT) it cannot evade the singularity. This fact will turn out to be closely 
related to the following property: each point inside region II represents 
a two-sphere that is a closed trapped surface. This means the following: 
consider any two-sphere p (represented by a point in figure 24) and 
two two-spheres q, s formed by photons emitted radially outwards, 
inwards at one instant from p. The area of g (which is given by 47rr?) 
will be greater than the area of p, but the area of s will be less than the 
area of p, if all three lie in a region r > 2m. However if they all lie in 
the region II where r < 2m, then the areas of both q and s will be less 
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than the area of p (in the figure, r decreases as one moves from the 
bottom to the top of region II). In that case, we say that p is a closed 
trapped surface. Each point inside region II’ represents a time- 
reversed closed trapped surface (the existence of trapped surfaces is 
a necessary consequence of the fact that the surfaces r = constant are 
spacelike), and correspondingly all particles in region II’ must have 
come from the singularity in the past. We shall see in chapter 8 that 
the existence of the singularities is closely related to the existence of 
the closed trapped surfaces. 
The Reissner—Nordstr6ém solution represents the space-time outside 
a spherically symmetric charged body carrying an electric charge (but 
with no spin or magnetic dipole, so this is not a good representation of 
the field outside an electron). The energy-momentum tensor is there- 
fore that of the electromagnetic field in the space-time which results 
from the charge on the body. It is the unique spherically symmetric 
asymptotically flat solution of the Einstein—Maxwell equations and is 
locally rather similar to the Schwarzschild solution; there exist 
coordinates in which the metric has the form 


9 3 ye } 2\-—1 
‘dea (: = m+) di? + (1 ante dr? + r?(d6? + sin? 6.d¢?), 
ar = us 


~ (5.26) 


where m represents the gravitational mass and e the electric charge of 
the body. This asymptotically flat solution would normally be 
regarded as the solution outside the body only, the interior being 
filled in with some other suitable metric; but it is again interesting to 
see what happens if we regard it as a solution for all r. 

If e? > m? the metric is non-singular everywhere except for the 
irremovable singularity at r = 0; this may be thought of as the point 
charge which produces the field. If e2 < m?, the metric also has singu- 
larities at r, and r_, where r, = m+ (m?—e®)}; it is regular in the 
regions defined by co > r >1r,,r, >r>r_andr_>r> 0 (if e = m2, 
only the first and third regions exist). As in the Schwarzschild case, 
these singularities may be removed by introducing suitable coordinates 
and extending the manifold to obtain a maximal analytic extension 
(Graves and Brill (1960), Carter (1966)). The major differences that 
arise are due to the existence of two zeros in the factor in front of dé? 
rather than one as in the Schwarzschild case. In particular this implies 
that the first and third regions are both static, whereas the second 
region (when it exists) is spatially homogeneous but is not static. 
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To obtain the maximally extended manifold, we proceed in steps 
analogous to those in the Schwarzschild case. Defining the coordinate 
7 Dy 


then for r > r,, 


2 2 


Z r 
r® = 7+ — + __ log (r—r,) -——>—_log (r—r 2 2 
(r,—r_) g ( +) (r,—17_) g ( na if e-<m 
9 
r* = r4+-mlog ((r—m)*)—— eo: 
g (( eee if 2 = m2, 
: ; 2 2 , 777, 
r* = r+mlog (r?— 2mr +e?) +5 saretan ( if e? > m2, 


Defining advanced and retarded coordinates v, w by 
vot+r*, w=t—r* 
the metric (5.26) takes the double null form 


2m e? 
2 


ds? = — (: 45) dv dw +r? (d6? + sin? 6 d¢?). (5.27) 


In the case e? < m?, define new coordinates v”, w” by 
ie Se Vian =P 
vy = arctan (exp ( rrr »)) , w eraretan ( ex Gene w)) : 


Then the metric (5.27) takes the form 


2m _ e? r,4 
ds? = ( —— | 64 ——_, cosec 2v” cosec 2w” dv" dw” 
mata (r,—1r_) 


+7?(d62+sin?6d¢?), (5.28) 
where 7 is defined implicitly by 


tan v” tanw” = —exp (5 =) ') (r—r,)2(r—r_)-#? 
ae 


and a = (r,)~*(r_)?. The maximal extension is obtained by taking 
(5.28) as the metric §*, and .@* as the maximal manifold on which 
this metric is C?. 

The Penrose diagram of the maximal extension is shown in figure 25. 
There are an infinite number of asymptotically flat regions, where 
r >r,; these are denoted by I. These are connected by intermediate 
regions II and III where 7, >r>r_ and r_>r> 0 respectively. 
There is still an irremovable singularity at 7 = 0 in each region III, 
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Ficure 25. Penrose diagram for the maximally extended Reissner—Nordstrém 
solution (e? <m™?). An infinite chain of asymptotically flat regions I 
(0 >r>r,) are connected by regions II (7, >r>r_) and III (r_>r> 0); 


each region III is bounded by a timelike singularity at r = 0. 


but unlike in the Schwarzschild solution, it is timelike and so can be 
avoided by a future-directed timelike curve from a region I which 
crosses 7 = r,. Such a curve cam pass through regions II, IIT and II 
and re-emerge into another asymptotically flat region I. This raises 
the intriguing possibility that one might be able to travel to other 
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universes by passing through the ‘wormholes’ made by charges. 
Unfortunately it seems that one would not be able to get back again 
to our universe to report what one had seen on the other side. 

The metric (5.28) is analytic everywhere except at r = r_ where it is 
at least C*. One can define different coordinates v" and w” by 


vo” = arc ta aS 
an (exp E se ; 


Ee eee (- a (= w)) 


2nr_2 


where 7 is an integer > 2(r_,)?(r_)-®. In these coordinates, the metric 
is analytic everywhere except at r = r, where it is at least C?. The 
coordinates v” and w” are analytic functions of v” and w" for r + r i 
or 7_. Thus the manifold .@* can be covered by an analytic atlas, con- 
sisting of local coordinate neighbourhoods defined by coordinates v” 
and w" for r + r_and by local coordinate neighbourhoods defined by 
v” and w” for r +7,. The metric is analytic in this atlas. 

The case e? = m* can be extended similarly; the case e2 > m? is 
already inextendible in the original coordinates. The Penrose diagrams 
of these two cases are given in figure 26. 

In all these cases, the singularity is timelike. This means that, unlike 
in the Schwarzschild solution, timelike and null curves can always 
avoid hitting the singularities. In fact the singularities appear to be 
repulsive: no timelike geodesic hits them, though non-geodesic time- 
like curves and radial null geodesics can. The spaces are thus timelike 
(though not null) geodesically complete. The timelike character of the 
singularity also means that there are no Cauchy surfaces in these 
spaces: given any spacelike surface, one can find timelike or null curves 
which run into the singularity and do not cross the surface. For 
example in the case e? < m?, one can find a spacelike surface Y which 
crosses two asymptotically flat regions I (figure 25). This is a Cauchy 
surface for the two regions I and the two neighbouring regions II. 
However in the neighbouring regions III to the future there are past- 
directed inextendible timelike and null curves which approach the 
singularity and do not cross the surface r = r_. This surface is there- 
fore said to be the future Cauchy horizon for Y. The continuation of 
the solution beyond r = r_is not determined by the Cauchy data on /. 
The continuation we have given is the only locally inextendible 
analytic one, but there will be other non-analytic C” continuations 
which satisfy the Einstein—Maxwell equations. 
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Ficure 26. Penrose diagrams for the maximally extended Reissner—Nordstr6m 
lutions: : ss 
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In the first case there is an infinite chain or regions I (co > r > m) connected by 

regions III (m > r > 0). The points p are not part of the singularity at r = 0, but 

are really exceptional points at infinity. 
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A particle P crossing the surface r =r, would appear to have 
infinite redshift to an observer O whose world-line remains outside 
r = r, and approaches the future infinity i+ (figure 25). In the region II 
between r = r, andr = r_, the surfaces of constant r are spacelike and 
so each point of the figure represents a two-sphere which is a closed 
trapped surface. An observer P crossing the surface r = r_ would see 
the whole of the history of one of the asymptotically flat regions I in 
a finite time. Objects in this region would therefore appear to be 
infinitely blue-shifted as they approached i+. This suggests that the 
surface r = r_ would be unstable against small perturbations in the 
initial data on the spacelike surface , and that such perturbations 
would in general lead to singularities on r = r_. 


5.6 The Kerr solution 


In general, astronomical bodies are rotating and so one would not 
expect the solution outside them to be exactly spherically symmetric. 
The Kerr solutions are the only known family of exact solutions which 
could represent the stationary axisymmetric asymptotically flat field 
outside a rotating massive object. They will be the exterior solutions 
only for massive rotating bodies with a particular combination of 
multipole moments; bodies with different combinations of moments 
will have other exterior solutions. The Kerr solutions do however 
appear to be the only possible exterior solutions for black holes (see 
§9.2 and §9.3). 

The solutions can be given in Boyer and Lindquist coordinates 
(r,6, ¢,t) in which the metric takes the form 


2 
ds? p2 (= + 46?) + (r?+ a?) sin? 6d¢? — dt? + = (asin? 6 d¢ — dt)’, 
(5.29) 
where p%(r,0)=r?+a?cos?@ and A(r) =7?-2mr-+a’. 


m and a are constants, m representing the mass and ma the angular 
momentum as measured from infinity (Boyer and Price (1965)); when 
a = 0 the solution reduces to the Schwarzschild solution. This metric 
form is clearly invariant under simultaneous inversion of ¢ and @, 
i.e. under the transformation t>—t, ¢+—4¢, although it is not 
invariant under inversion of t alone (except when a = 0). This is what 
one would expect, since time inversion of a rotating object produces 
an object rotating in the opposite direction. 


162 EXACT SOLUTIONS [5.6 


When a? > m?, A > 0 and the above metric is singular only when 
r = 0. The singularity at r = 0 is not in fact a point but a ring, as can 
be seen by transforming to Kerr-Schild coordinates (x, y,z,¢), where 


xt+iy = (r+ia) sin Bexpi | (dg +a dr), 
z2=rcose, b= | (at-+ (2 4a) tar) — 


In these coordinates, the metric takes the form 
ds? = dx? + dy? + dz? — dé? 


2mr (r(andx+ydy)—a(xdy— yde) | zdz 


<< al) (5.30) 


where r is determined implicitly, up to a sign, in terms of 2, y, z by 
— (2? +9? 4-24 —a*) r*—a72* = 0. 


For r + 0, the surfaces {r = constant} are confocal alamei: in the 
(x,y,z) plane, which degenerate for r = 0 to the disc z?+ y? < a?,z = 0. 
The ring x? + y? = a*, z = 0 which is the boundary of this inven is a real 
curvature singularity as the scalar polynomial R,,.,R@ diverges 
there. However no scalar polynomial diverges on the disc except at 
the boundary ring. The function 7 can in fact be analy tically con- 
tinued from positive to negative values through the interior of the dise 
vty? <a?, z=0, to obtain a maximal analytic extension of the 
solution. 

To do this, one attaches another plane defined by coordinates 
(x’, y’,z') where a point on the top side of the disc z2+y? < a2, z = 0 
in the (x,y,z) plane is identified with a point with the same x and y 
coordinates on the bottom side of the corresponding dise in the 
(x',y’, 2’) plane. Similarly a point on the bottom side of the disc in the 
(x, y, 2) plane is identified with a point on the top side of the disc in the 
(x’, y’, 2’) plane (see figure 27). The metric (5.30) extends in the obvious 
way to this larger manifold. The metric on the (x’, y’, 2’) region is again 
of the form (5.29), but with negative rather than positive values of r. 
At large negative values of 7, the space is again asymptotically flat 
but this time with negative mass. For small negative values of r near 
the ring singularity, the vector @/@f is timelike, so the circles 
(t = constant, r = constant, 0 = constant) are closed timelike curves. 
These closed timelike curves can be deformed to pass through any 
point of the extended space (Carter (1968a)). This solution is geodesic- 


y = constant 


(7 > 0) 
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ally incomplete at the ring singularity. However the only timelike and 
null geodesics which reach this singularity are those in the equatorial 
plane on the positive r side (Carter (1968a)). 
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FIguRE 27. The maximal extension of the Kerr solution for a? > m? is obtained 
by identifying the top of the dise ~? + y? < a?,z = 0in the (a, y, z) plane with the 
bottom of the corresponding dise in the (x’, y’, z’) plane, and vice versa. The 
figure shows the sections y = 0, y’ = 0 of these planes. On circling twice round 
the singularity at 72+ y* = a?, z = 0 one passes from the (z, y, z) plane to the 
(a’, y’, 2’) plane (where 7 is negative) and back to the (zx, y, z) plane (where r is 
positive). 


The extension in the case a* < m? is rather more complicated, 
because of the existence of the two values r, = m+(m?—a?)? and 
r_ = m—(m?—a?)t of r at which Air) vanishes. These surfaces are 
similar to the surfaces r=r,, r=r7_ in the Reissner—Nordstrém 
solution. To extend the metric across these surfaces, one transforms 
to the Kerr coordinates (r,0,¢,,u,.), where 


du, = dt+(r?+a?)A-dr, dé, =d¢+aAdr. 
The metric then takes the form 
ds? = p?d@?— 2asin?6drd¢, +2drdu., 
+ p~?[(r? + a)? — Aa? sin? 6] sin? 6 d@,” 


— 4ap—mr sin? 6d¢, du, —(1—2mrp-*)du,? (5.31) 
6-2 
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on the manifold defined by these coordinates, and is analytic at 
r=r, andr = r_. One again has a singularity at r = 0, which has the 
same ring form and geodesic structure as that described above. The 
metric can also be extended on the manifold defined by the coordinates 
(r,0, p_, u_) where 


du_ = dt—(r?+a?)A-1dr, ddé_=dd¢—aAar; 


the metric again takes the form (5.31), with ¢,,u, replaced by — ¢_, 
—u_.The maximalanalytic extension can be built up by a combination 
of these extensions, as in the Reissner-Nordstrom case (Boyer and 
Lindquist (1967), Carter (1968a)). The global structure is very similar 
to that of the Reissner-Nordstrém solution except that one can now 
continue through the ring to negative values of r. Figure 28 (i) shows 
the conformal structure of the solution along the symmetry axis. The 
regions I represent the asymptotically flat regions in which r > r,. 
The regions II (r_<7r<r,) contain closed trapped surfaces. The 
regions III (—o <r <r_) contain the ring singularity; there are 
closed timelike curves through every point in a region III, but no 
causality violation occurs in the other two regions. 

In the case a? = m?,r, and r_ coincide and there is no region II. The 
maximal extension is similar to that of the Reissner—Nordstrém solu- 
tion when e? = m?. The conformal structure along the symmetry axis 
in this case is shown in figure 28 (ii). 

The Kerr solutions, being stationary and axisymmetric, have a 
two-parameter group of isometries. This group is necessarily Abelian 
(Carter (1970)). There are thus two independent Killing vector fields 
which commute. There is a unique linear combination K¢ of these 
Kalling vector fields which is timelike at arbitrarily large positive and 
negative values of r. There is another unique linear combination K¢ 
of the Killing vector fields which is zero on the axis of symmetry. The 
orbits of the Killing vector A® define the stationary frame, that is, an 
object moving along one of these orbits appears to be stationary with 
respect to infinity. The orbits of the Killing vector K@ are closed curves, 
and correspond to the rotational symmetry of the solution. 

In the Schwarzschild and Reissner-Nordstrém solutions, the 
Killing vector AK* which is timelike at large values of r is timelike 
everywhere in the region I, becoming null on the surfaces r = 2m and 
r = r, respectively. These surfaces are null. This means that a particle 
which crosses one of these surfaces in the future direction cannot 
return again to the same region. They are the boundary of the region 
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PS, Pare 


Ficure 28. The conformal structure of the Kerr solutions along the axis of 
symmetry, (1) in the case 0 < a? < m?, (ii) in the case a? = m?. The dotted lines 
are lines of constant 7; the regions I, II and III in case (i) are divided by 7 = r, 
and r = r_, and the regions I and III in case (ii) by r = m. In both cases, the 
structure of the space near the ring singularity is as in figure 27. 


of the solution from which particles can escape to the infinity %+ of 
a particular region I, and are called the event horizons of that 4+. (They 
are in fact the event horizon in the sense of § 5.2 for an observer moving 
on any of the orbits of the Killing vector K? in the region I.) 

In the Kerr solution on the other hand, the Killing vector K@ is 
spacelike in a region outside r = r,, called the ergosphere (figure 29). 
The outer boundary of this region is the surface r = m + (m? — a cos? 6)3 
on which Kis null. This is called the stationary limit surface since it is 
the boundary of the region in which particles travelling on a timelike 
curve can travel on an orbit of the Killing vector K*, and so remain at 
rest with respect to infinity. The stationary limit surface is a timelike 
surface except at the two points on the axis, where it is null (at these 
points it coincides with the surface r = r,,). Where it is timelike it can 
be crossed by particles in either the ingoing or the outgoing direction. 
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Ficure 29. In the Kerr solution with 0 < a? < m*, the ergosphere lies between 
the stationary limit surface and the horizon at r= r,. Particles can escape to 
infinity from region I (outside the event horizon r = r,) but not from region IT 
(between r= 7, and r=r_) and region III (r <7_; this region contains the 
ring singularity). 


@ 


; limit surface 
Ficure 30. The equatorial plane of a Kerr solution with m? > a2. The circles 


represent the position a short time later of flashes of light emitted by the points 
represented by heavy dots. 


5.6] THE KERR SOLUTION 167 


It is therefore not the event horizon for .%+. In fact the event horizon 
is the surface r = 7, = m+(m*?—a?)}. Figure 30 shows why this is. It 
shows the equatorial plane 0 = 47; each point in this figure represents 
an orbit of the Killing vector K%, i.e. it is stationary with respect 
to %*. The small circles represent the position a short time later of 
flashes of light emitted from the points represented by the heavy 
black dots. Outside the stationary limit the Killing vector K@ is time- 
like and so lies within the light cone. This means that the point in 
figure 30 representing the orbit of emission lies within the wavefront 
of the light. 

On the stationary limit surface, A“ is null and so the point repre- 
senting the orbit of emission lies on the wavefront. However the wave- 
front lies partly within and partly outside the stationary limit surface; 
it is therefore possible for a particle travelling along a timelike curve 
to escape to infinity from this surface. In the ergosphere between the 
stationary limit surface and r = r,, the Killing vector K* is spacelike 
and so the point representing the orbit of emission lies outside the 
wavefront. In this region it is impossible for a particle moving on a 
timelike or null curve to travel along an orbit of the Killing vector and 
so to remain at rest with respect to infinity. However the positions of 
the wavefronts are such that the particles can still escape across the 
stationary limit surface and so out to infinity. On the surface r = r,, 
the Killing vector A® is still spacelike. However the wavefront corre- 
sponding to a point on this surface hes entirely within the surface. 
This means that a particle travelling on a timelike curve from a point 
on or inside the surface cannot get outside the surface and so cannot 
get out to infinity. The surface r = r, is therefore the event horizon 
for %+ and is a null surface. 

Although the Killing vector K® is spacelike in the ergosphere, the 
magnitude K¢K°K,,K,, of the Killing bivector K,,4,) is negative every- 
where outside r=r_, except on the axis K* = 0 where it vanishes. 
Therefore K¢ and K@ span a timelike two-surface and so at each point 
outside 7 = 7, off the axis there is a linear combination of K* and K+ 
which is timelike. In a sense, therefore, the solution in the ergosphere 
is locally stationary, although it is not stationary with respect to 
infinity. In fact there is no one linear combination of K* and K¢ which 
is timelike everywhere outside r = 7. The magnitude of the Killing 
bivector vanishes on r = r,, and is positive just inside this surface. 
Onr = r,, both K@ and K¢ are spacelike but there is a linear combina- 
tion which is null everywhere on r = 7, (Carter (1969)). 
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The behaviour of the ergosphere and the horizon we have discussed 
will play an important part in our discussion of black holes in §9.2 
and § 9.3. 

Just as the Reissner-Nordstrém solution can be thought of as 
a charged version of the Schwarzschild solution, so there is a family of 
charged Kerr solutions (Carter (1968a)). Their global properties are 
very similar to those of the uncharged Kerr solutions. 


5.7. Gédel’s universe 


In 1949, Kurt Godel published a paper (Gédel (1949)) which provided 
a considerable stimulus to investigation of exact solutions more com- 
plex than those examined so far. He gave an exact solution of 
Einstein’s field equations in which the matter takes the form of a 
pressure-free perfect fluid (7, = pu,u, where p is the matter density 
and uw, the normalized four-velocity vector). The manifold is R* and 
the metric can be given in the form 

ds? = — di? + dz? — 4 exp (2(,/2) wx) dy? + dz? — 2 exp ((,/2) wx) dt dy, 
where w > 0 is a constant; the field equations are satisfied if u = 0/é2x° 
(i.e. u? = d4,) and 4np = w = —A. 

The constant w is in fact the magnitude of the vorticity of the flow 
vector w?. 

This space-time has a five-dimensional group of isometries which 
is transitive, 1.e. it is a completely homogeneous space-time. (An 
action of a group is transitive on -@ if it can map any point of .W into 
any other point of ./.) The metric is the direct sum of the metric §, 
given by 

ds? = — dé? + dz? — } exp (2(,/2) wx) dy?— 2. exp ((,/2) wx) dt dy 
on the manifold 4, = R® defined by the coordinates (t,x, y), and the 
metric $, given by ds,2 = dz? 


on the manifold 4, = R! defined by the coordinate z. In order to 
describe the properties of the solution it is sufficient to consider only 


(M,, oy 
Defining new coordinates (#’, r, 6) on , by 


exp ((,/2) wx) = cosh 2r + cos d sinh 2r, 
wy exp ((./2) wx) = sin d sinh 2r, 
tan 3(¢ + wt — (./2)t’) = exp (— 2r) tan 4¢, 
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the metric §, takes the form 
dsj? = 2u-*(— dt’? + dr? — (sinh*r — sinh? r) dg? + 2(,/2) sinh? r dd de), 


where —c0 <t< 0,0<r<o,and0<¢ < 27, ¢ = 0 being identified 
with ¢ = 27; the flow vector in these coordinates is u = (/ (./2)) eat’. 
This form exhibits the rotational symmetry of the solution about the 
axisr = 0. By a different choice of coordinates the axis could be chosen 
to lie on any flow line of the matter. 


iP © 
(coordinate axis) 
_ -Matter world-line 


(nd aaah p’s future null cone 


(refocusses at p”) 


He 
Null cone y 3 
s null c fi t 
Null cone tangent to at 4a 
includes circle Caustic on p's 
. }: 7 
circle\ | S | TS Null geodesics future null cone 


Null cone 
tangent to 
circle 


r > log (i+./2) LG ee a 
eee 
be 
vi, 


(eldsed fimielike r = log (1+,2) <n Bicone 
curve) \ (closed null curve)‘ cis eae E 
a 
(Vf. p’s future ee 
r < log (1+,/2)  & null cone 


(closed spacelike 
curve) 
¢ 
. 

Fictret 31. Gédel’s universe with the irrelevant coordinate z suppressed. The 
space is rotationally symmetric about any point; the diagram represents cor- 
rectly the rotational symmetry about the axis r = 0, and the time invariance. 
The light cone opens out and tips over as r increases (see line J) resulting in 
closed timelike curves. The diagram does not correctly represent the fact that 
all points are in fact equivalent. 


The behaviour of (-4,, §,) is illustrated in figure 31. The light cones 
on the axis r = 0 contain the direction 0/ét’ (the vertical direction on 
the diagram) but not the horizontal directions @/ér and 0/é¢. As one 
moves away from the axis, the light cones open out and tilt in the 
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¢-direction so that at a radius r = log(1+,/2), 0/0¢ is a null vector 
and the circle of this radius about the origin is a closed null curve. 
At greater values of r, 2/0¢ is a timelike vector and circles of constant 
r, t' are closed timelike curves. As (4, $,) has a four-dimensional 
group of isometries which is transitive, there are closed timelike curves 
through every point of (.4,, 8,), and hence through every point of the 
Godel solution (4, §). 

This suggests that the solution is not very physical. The existence 
of closed timelike curves in this solution implies that there are no 
imbedded three-dimensional surfaces without boundary in @ which 
are spacelike everywhere. For a closed timelike curve which crossed 
such a surface would cross it an odd number of times. This would mean 
that the curve could not be continuously deformed to zero, since a 
continuous deformation can change the number of crossings only by 
an even number. This would contradict the fact that -@ is simply 
connected, being homeomorphic to A+. The existence of closed time- 
like lines also shows that there can be no cosmic time coordinate tin. # 
which increases along every future-directed timelike or null curve. 

The Gédel solution is geodesically complete. The behaviour of the 
geodesics can be described in terms of the decomposition into (4, $;) 
and (4, 82). Since the metric §, of -@, is flat, the component of the 
geodesic tangent vector in .@, is constant, i.e. the z-coordmate varies 
linearly with the affine parameter on the geodesic. It is sufficient there- 
fore to describe the behaviour of geodesics in (4, $,). The null 
geodesics from a point p on the axis of coordinates (figure 31) diverge 
from the axis initially, reach a caustic at r = log (1+(,/2)), and then 
reconverge to a point p’ on the axis. The behaviour of timelike geo- 
desics is similar: they reach some maximum value of r less than 
log (1+ (/2)) and then reconverge to p’. A point g at a radius r greater 
than log (1+(,/2)) can be joined to p by a timelike curve but not by 
a timelike or null geodesic. 

Further details of Godel’s solution can be found in Gédel (1949), 
Kundt (1956). 


5-8 Taub-NUT space 


In 1951, Taub discovered a spatially homogeneous empty space solu- 
tion of Einstein’s equations with topology R x 8? and metric given by 
ds? = — U-! dt? + (21)? U(dy + cos 6 dg)? 


+ (+12) (d6? + sin?0d¢2), (5.32) 
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U(t)=—1+ Poe ™ and / are positive constants. 


Here 6, ¢, y are Euler coordinates on S3, so 0 < yy < 47,0 <6 <7, 
0 << 27. This metric is singular at t = t, =m + (m?+I?)3, where 
U = 0. It can in fact be extended across these surfaces to give a space 
found by Newman, Tamburino and Unti (1963), but before discussing 
the extension we shall consider a simple two-dimensional example 
given by Misner (1967) which has many similar properties. 

This space has the topology S! x #! and the metric g given by 


ds? = —t1d®+tdy? 


where 0 < yy < 27. This metric is singular when ¢t = 0. However if one 
takes the manifold .# defined by y% and by 0 <t < @, (4, §) can 
be extended by defining y’ = y—logt. The metric then takes the 


form ¢’ given by ds? a 2 dy’ dt +t(dy’)?. 


This is analytic on the manifold @’ with topology S! x R! defined by 
y’ and by —0% <t < o. The regiont > 0 of (.4@’, 8’) is isometric with 
(.@, 8). The behaviour of (.4@’, 8’) is shown in figure 32. There are 
closed timelike lines in the region ¢ < 0, but there are none when 
t > 0. One family of null geodesics is represented by the vertical lines 
in figure 32; these cross the surface ¢ = 0. The other family spiral 
round and round as they approach t = 0, but never actually cross this 
surface, and these geodesics have only finite affine length. Thus the 
extension (.@’, 8’) is not symmetric between the two families of null 
geodesics, although the original space (.#, 8) was. However one can 
define another extension (.#”, &”) in which the behaviour of the two 
families of null geodesics is interchanged. To do so define yy” by 
yr” = w+logt. The metric takes the form §” given by 


ds? = —2dy" dt+t(dy’)?. 


This is analytic on the manifold .@” with topology S' x R! defined 
by w” and —0 <t< oo. The region ¢ > 0 of (4, 8") is isometric 
with (4, 8). Inasense, what we have done by defining y” is to untwist 
the second family of null geodesics so that they become vertical lines, 
and can be continued beyond ¢ = 0. However this twisting winds up 
the first family of null geodesics so that they spiral around and cannot 
be continued beyond t = 0. One has therefore two inequivalent locally 
inextendible analytic extensions of (4, 8), both of which are geodesic- 
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FIGURE 32. Misner’s two-dimensional example. 

(i) Extension of region I across the boundary ¢t = 0 into II. The vertical null 
geodesics are complete, but the twisted null geodesics are incomplete. 

(11) The universal covering space is two-dimensional Minkowski space. Under 
the discrete subgroup @ of the Lorentz group, points s are equivalent; similarly 


points r, g and ¢ are equivalent. (i) is obtained by identifying equivalent points 
in regions I and II. 
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ally incomplete. The relation between these two extensions can be 
seen clearly by going to the covering space of (4, g). 

This is in fact the region I of two-dimensional Minkowski space 
(4, 4) contained within the future null cone of a point p (figure 32 (ii)). 
The isometries of (4, y) which leave p fixed form a one-dimensional 
group (the Lorentz group of 4) whose orbits are the hyperbolae 
{o = constant} where o = f?— # and 7, # are the usual Minkowski 
coordinates. The space (.@, 8) is the quotient of (J, 4) by the discrete 
subgroup G of the Lorentz group consisting of A” (n integer) where 
A maps (£, @) to 


(cosh 7 + sinh 7, cosh a+ésinhz), 
i.e. one identifies the points 
(Ecosh nz + & sinh nz, Z cosh nz +? sinh nz) 
for all integer values of n, and these correspond to the point 
t= }(P-#), p= 2arctanh(z/f) in #4. 


The action of the isometry group G in the region I is properly dis- 
continuous. The action of a group H on a manifold -/ is said to be 
properly discontinuous if: 

(1) each pointge.WV hasa neighbourhood Y such that A(YW)n V= @ 
for each A € H which is not the identity element, and 

(2) if¢,re¥ are such that there isno A €¢H with Aq = r, then there 
are neighbourhoods WY and @’ of g and r respectively such that there 
isno BeH with B(Y%)nW' + gc. 

Condition (1) implies that the quotient //H is a manifold, and 
condition (2) implies that it is Hausdorff. Thus the quotient (I, )/G is 
the Hausdorff space (-4, 8). The action of G is also properly discon- 
tinuous in the regions I+II (é > —#). Thus (I+1I,4)/@ is also a 
Hausdorff space; in fact it is (4’, 8’). Similarly (I1+III, 4)/@ is the 
Hausdorff space (#”, &”) where I+III is the region ¢ > . From this 
it can be seen how it is that one family of null geodesics can be com- 
pleted in the extension (.@’, 8’) while the other family can be com- 
pleted in the extension (.@”, 8”). This suggests that one might perform 
both extensions at the same time. However the action of the group on 
the region (I+ 11 + III) (i.e. # > — |#|) satisfies condition (1) but condi- 
tion (2) is not satisfied for points g on the boundary between I and II 
and points 7 on the boundary between I and ITI. Therefore the quotient 
(I+ 11 +III, 4)/G is not Hausdorff although it is still a manifold. 
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This kind of non-Hausdorff behaviour is different from that in the 
example given in §2.1. In that example, one could have continuous 
curves which bifurcate, one branch going into one region and another 
branch going into another region. Such behaviour of an observer’s 
world-line would be very uncomfortable. However the manifold 
(I+ 11+ III)/G does not have any such bifurcating curves; curves in I 
can be extended into II or III but not into both simultaneously. Thus 
one might be prepared to relax the Hausdorff requirement on a space— 
time model to allow this sort of situation but not the sort in which one 
gets bifurcating curves. Further work on non-Hausdorff space—times 
can be found in the papers of Hajicek (1971). 

Condition (1) is in fact satisfied by the action of G on.@ — {p}. Thus 
the space (4 — {p}, 7)/G is in some sense the maximal non-Hausdorff 
extension of (W,§). However it is still not geodesically complete 
because there are geodesics which pass through the point p which has 
been left out. If p is included the action of the group does not satisfy 
condition (1), and so the quotient .7/G is not even a non-Hausdorff 
manifold. However consider the bundle of linear frames L(-/), i.e. the 
collection of all pairs (X, ¥), X, Ye 7, of linearly independent vectors 
at all points ge.@. The action of an element A of the isometry group 
G on.@ induces an action A , on L(.#) which takes the frame (X, Y) 
at q to the frame (A, X,A, Y) at A(q). This action satisfies condi- 
tion (1) because even for (X, Y)e€7,, A,X + X and 4, Y + Y unless 
A = identity, and satisfies condition (2) even if X and Y lie on the null 
cone of p. Thus the quotient Li 7 )/G is a Hausdorff manifold. It is 
a fibre bundle over the non-Hausdorff non-manifold.#/G. One could 
in a sense regard it as the bundle of linear frames for this space. The 
fact that the bundle of frames can be well behaved even though the 
space is not, suggests that it is useful to look at singularities by using 
the bundle of linear frames. A general procedure for doing this will 
be given in §8.3. 

We shall now return to the four-dimensional Taub space (.@, §) 
where -@ is R1 x 8S? and g is given by (5.32). As-@ is simply connected, 
one cannot take a covering space as we did in the two-dimensional 
example. However one can achieve a similar result by considering @ 
as a fibre bundle over S? with fibre R!x S!; the bundle projection 
7: M > S* is defined by (E, ¥,9,)—> (0, ¢). This is in fact the pro- 
duct with the t-axis of the Hopf fibering S$— S$? (Steenrod (1951)) 
which has fibre S!. The space (.W, 8) admits a four-dimensional group 
of isometries whose surfaces of transitivity are the three-spheres 


5.8] TAUB-NUT SPACE 175 


{é = constant}. This group of isometries maps fibres of the bundle 
7: M -> S* into fibres, and so the pairs (¥, &) are all isometric, where 
F is a fibre (F¥ ~ R1x S81) and & is the metric induced on the fibre 
by the four-dimensional metric § on -@. The fibre F can be regarded 
as the (¢, y) plane, and the metric 8 on F is obtained from (5.32) by 
dropping the terms in dé and d¢; thus 8 is given by 


ds? = — U-1 dt + 42U (dy). (5.33) 


The tangent space TJ, at the point gé.W can be decomposed into 
a vertical subspace V, which is tangent to the fibre and is spanned by 
the vectors ¢/ct and ¢/éy, and a horizontal subspace H, which is 
spanned by the vectors @/¢@ and 0/¢¢ — cos 0 d/ey. Any vector XeT, 
can be split into a part X, lying in V, and a part X, lying in H,. The 
metric § on J, can then be expressed as 


g(X, Y) = gp (Xp, Yy)+ (+2) 97 (74Xq,7%Y x); (5.34) 


where gy = & and §,, is the standard metric on the two-sphere given 
by ds? = d6?+sin?6d¢?. Thus although the metric g is not the direct 
sum of g, and (#?+/?)g,, (because R! x S3 is not the direct product of 
Rx §! with S?) it can nevertheless be regarded as such a sum locally. 

The interesting part of the metric § is contained in g, and we shall 
therefore consider analytic extensions of the pair (F,8,). When com- 
bined with the metric $, of the two-sphere as in (5.34), these give 
analytic extensions of (., §). 

The metric §,, given by (5.33), has singularities at ¢ = ¢,, where 
U = 0. However if one takes the manifold ¥F, defined by y% and by 
t_<t<t,, (Fo, 8) can be extended by defining 


dat 
fey ssa 
oe SAG) 


The metric then takes the form g,’ given by 
ds? = 41 dy'(lU(t) dy’ — dé). 


This is analytic on the manifold F’ with topology S!x F defined by 
yy’ and by —00 <t < o. Theregiont_ <t < t, of (F’, $,’) is isometric 
with (Fo, $,). There are no closed timelike curves in the region 
t_<t<t, butthereare fort < t_and fort > t,. The behaviour is very 
much as for the space (.4’, 8’) we considered before, except that there 
are now two horizons (at t = t_ and ¢ = t,) instead of the one horizon 
(at t = 0). One family of null geodesics crosses both horizons ¢ = t_ and 
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t =t, but the other family spirals round near these surfaces and is 


‘creanewplete 
As before, one can make another extension by defining the 
coordinate dt 


Yo No i U(t)’ 
The metric then takes the form g,” given by 
ds? = 4 dy" (lU (t) dw” + dé) 


which is analytic on the manifold #" defined by yw” and by 
—co <t < ©, and is again isometric to (Fo, $,) ont_<t <t,. 

Once again one can show the relation between the different exten- 
sions by going to the covering space. The covering space of F 9 is the 
manifold #, defined by the coordinates —c < yr <co and by 
a js On# » the metric g, can be written in the double null form 


ds? = 422U(t) dy’ dy’, (5.35) 


where —00 < yf’ < «0, —0 < w” < o. One canextend this in a manner 
similar to that used in the Reissner—-Nordstrém solution. Define new 
coordinates (u,,v,) and (w_,v_) on F, by 
us =arctan(expy’/a.), v, = aretan(—(exp—y’/a,)), 
= 
t_—t oa 
Wace ©. = Fea Ol ae: 
n is some integer greater than (mt, +/*)/(mt_+/*). Then the metric §,- 
obtained by applying this transformation to (5.35) is analytic on the 
manifold ¥ shown in figure 33, where the coordinates (u,,v,) are 
analytic coordinates except att = t_ where they are at least C?, and the 
coordinates (w_,v_) are analytic coordinates except at ¢ = t, where 
they are at least C?. This is rather similar to the extension of the (t, 7) 
plane of the Reissner—Nordstrém solution. 

The space (¥, &-) has a one-dimensional group of isometries, the 
orbits of which are shown in figure 33. Near the points p,, p_ the 
action of this group is similar to that of the Lorentz group in two- 
dimensional Minkowski space (figure 32(ii)). Let G be the discrete 
subgroup of the isometry group generated by a non-trivial element A 
of the isometry group. The space (F), g;-) is the quotient of one of 
the regions (IL,, 8) by G. The space (¥’,§-’) is the quotient 
(I_+11, +10, -)/G, and (F”, oe ‘) is the quotient 


(e+ IL, il apie? 
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ae as 
ITi_ 
oF | = a0 
S 
¢ =—0a 
We, 
S 
Ate ” L Homogeneous surfaces 


{t = constant} (timelike 
NY § =— group orbits) : 


Homogeneous surfaces 
{t = constant} (spacelike 
group orbits) 


FicurRE 33. Penrose diagram of the maximally extended covering space of a 
two-dimensional section of Taub-NUT space, showing orbits of the isometry 
group. Taub—NUT space and its extensions are obtained from part of this space 
by identification of points under a diserete subgroup of the isometry group. 


One would also obtain a Hausdorff manifold by taking the quotient of 
(I, +11,+1_): this corresponds to extending like (¥’, §,’) at the 
surface ¢ = t, but extending like (¥”, g,") at the surface ¢t = t_. By 
taking the quotient of the whole space ¥ minus the points p, and p_ 
one obtains a non-Hausdorff manifold; and taking the quotient of F 
one obtains a non-Hausdorff non-manifold in a way analogous to that 
in the example above. As in that example, one can take the quotient 
of the bundle of linear frames over ¥ and obtain a Hausdorff manifold. 
By combining these extensions of the (é, yy) plane with the coordi- 
nates (?,¢) one can obtain corresponding extensions of the four- 
dimensional space (.@, §). In particular, the two extensions (¥', 8p’) 
and (F", g,-") give rise to two different locally inextendible analytic 
extensions of (.4, &), and both are geodesically incomplete. 
Consider one of these extensions, say (.4’, 8’). The three-spheres 
which are the surfaces of transitivity of the isometry group are space- 
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like surfaces in the region ¢_ < ¢ < t, and are timelike for ¢ > ¢,. and 
t<t_. The two surfaces of transitivity ¢=t_ and ¢=¢, are null 
surfaces and they form the Cauchy horizon of any spacelike surface 
contained in the region t_ < t < t,, because there are timelike curves 
in the regions ¢ < ¢_ and ¢ > t, which do not cross ¢ = t_ and ¢ = t, 
respectively (for example, closed timelike curves exist in the regions 
t <t_andt > t,). The region of space-time t_ < ¢ < t, is compact yet 
there are timelike and null geodesics which remain within it and are 
incomplete. This kind of behaviour will be considered further in 
chapter 8. 

Further details of Taub-NUT space may be found in Misner and 
Taub (1969), Misner (1963). 


5.9 Further exact solutions 


We have examined in this chapter a number of exact solutions and 
used them to give examples of the various global properties which we 
shall wish to discuss more generally later. Although a large number of 
exact solutions are known locally, relatively few have been examined 
globally. To complete this chapter, we shall mention briefly two other 
interesting families of exact solutions whose global properties are 
known. 5 

The first of these are the plane wave solutions of the empty space 
field equations. These are homeomorphic to R, and global coordinates 
(y, z, u, v), Which range from — co to +00, can be chosen so that the 
metric takes the form 


ds? = 2dudv+dy?+-dz?+ Aly, z, u) du?, 
where H = (y?—2?) f(u) — 2yzg(u); 


f(u) and 9(w) are arbitrary C? functions determining the amplitude 
and polarization of the wave. These spaces are invariant under a five- 
parameter group of isometries multiply transitive on the null surfaces 
{u = constant}; a special subclass, in which f(w) = cos 2u,g(u) = sin 2u, 
admit an extra Killing vector field, and are homogeneous space—times 
invariant under a six-parameter group of isometries. These spaces 
do not contain any closed timelike or null curves; however they 
admit no Cauchy surfaces (Penrose (1965a)). Local properties 
of these spaces have been studied in detail by Bondi, Pirani and 
Robinson (1959), and global properties by Penrose (1965a); Oszvath 
and Schiicking (1962) have studied global properties of the higher 
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symmetry space. The way in which two impulsive plane waves scatter 
each other and give rise to a singularity has been studied by Khan and 
Penrose (1971). 

The other is the five-parameter family of exact solutions of the 
source-free Einstein—Maxwell equations found by Carter (1968) (see 
also Demianski and Newman (1966)). These include the Schwarzschild, 
Reissner—Nordstrom, Kerr, charged Kerr, Taub—-NUT, de Sitter and 
anti-de Sitter solutions as special cases. A description of some of their 
global properties is given in Carter (1967). Some cases closely related 
to this family have been examined by Ehlers and Kundt (1962) and 
Kinnersley and Walker (1970). 


6 


Causal structure 


By postulate (a) of § 3.2, a signal can be sent between two points of 4 
only if they can be joined by a non-spacelike curve. In this chapter we 
shall investigate further the properties of such causal relationships, 
establishing a number of results which will be used in chapter 8 to 
prove the existence of singularities. 

By § 3.2, the study of causal relationships is equivalent to that of the 
conformal geometry of -@, i.e. of the set of all metrics 8 conformal to 
the physical metric § (&§ = 078, where Q is a non-zero, C” function). 
Under such a conformal transformation of the metric a geodesic curve 
will not, in general, remain a geodesic curve unless it is null, and even 
in this case an affine parameter along the curve will not remain an 
affine parameter. Thus in most cases geodesic completeness (1.e. 
whether all geodesics can be extended to arbitrary values of their 
affine parameters) will depend on the particular conformaMactor and 
so will not (except in certain special cases described in §6.4) be a 
property of the conformal geometry. In fact Clarke (1971) and Siefert 
(1968) have shown that, provided a physically reasonable causality 
condition holds, any Lorentz metric is conformal to one in which all 
null geodesics and all future-directed timelike geodesics are complete. 
Geodesic completeness will be discussed further in chapter 8 where it 
forms the basis of a definition of a singularity. 

§6.1 deals with the question of the orientability of timelike and 
spacelike bases. In $6.2 basic causal relations are defined and the 
definition of a non-spacelike curve is extended from piecewise dif- 
ferentiable to continuous. The properties of the boundary of the future 
of a set are derived in § 6.3. In § 6.4 a number of conditions which rule 
out violations or near violations of causality are discussed. The closely 
related concepts of Cauchy developments and global hyperbolicity 
are introduced in $6.5 and §6.6, and are used in §6.7 to prove the 
existence of non-spacelike geodesics of maximum length between 
certain pairs of points. 

In §6.8 we describe the construction of Geroch, Kronheimer and 

[180] 
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Penrose for attaching a causal boundary to space-time. A particular 
example of such a boundary is provided by a class of asymptotically 
flat space—times which are studied in § 6.9. 


6.1 Orientability 


In our neighbourhood of space-time there is a well-defined arrow of 
time given by the direction of increase of entropy in quasi-isolated 
thermodynamic systems. It is not quite clear what the relationship 
is between this arrow and the other arrows defined by the expansion 
of the universe and by the direction of electrodynamic radiation; the 
reader who is interested will find further discussion in Gold (1967), 
Hogarth (1962), Hoyle and Narlikar (1963) and Ellis and Sciama 
(1972). Physically it would seem reasonable to suppose that there is 
a local thermodynamic arrow of time defined continuously at every 
point of space-time, but we shall only require that it should be possible 
to define continuously a division of non-spacelike vectors into two 
classes, which we arbitrarily label future- and past-directed. If this is 
the case, we shall say that space-time is teme-orientable. In some 
space-times it is not possible to define such a time-orientation. An 
example is the space time obtained from de Sitter space (§5.2) in 
which points are identified by reflection through the origin of the five- 
dimensional imbedding space. In this space there are closed curves, 
non-homotopie to zero, on going round which the orientation of time 
is reversed. However this difficulty could clearly be resolved by simply 
unidentifying the points again, and in fact this is always the case: if 
a space-time (4,8) is not time-orientable, then it has a double 
covering space (7, 8) which is. @ may be defined as the set of all 
pairs (p, «) where pé.@ and a is one of the two orientations of time 
at p. Then with the natural structure and the projection 7: (p, «) —p, 
MM isa double covering of .4. If @ consists of two disconnected com- 
ponents then (.4, 8) is time-oriéntable. If 7 is connected, then (4, g) 
is not time-orientable but. (7, ©) is. In the following sections we shall 
assume that either (.@, 8) is time-orientable or we are dealing with the 
time-orientable covering space. If one can prove the existence of 
singularities in this space-time then there must also be singularities 
in (4, $). 

One may also ask whether space-time is space-orientable, that is 
whether it is possible to divide bases of three spacelike axes into right 


handed and left handed bases in a continuous manner. Geroch (1967a@) 
i HLS 
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has pointed out that there is an interesting connection between this 
and time-orientability which follows because some experiments 
on elementary particles are not invariant under charge or parity 
reversals, either singly or together. On the other hand there are theo- 
retical reasons for believing that all interactions are invariant under 
the combination of charge, parity and time reversals (CPT theorem; 
see Streater and Wightman (1964)). If one believes that the non- 
invariance of weak interactions under charge and parity reversals is 
not merely a local effect but exists at all points of space-time, then it 
follows that going round any closed curve either the sign of a charge, 
the orientation of a basis of spacelike axes, and the orientation of time 
must all reverse, or none of them does. (The ordinary Maxwell theory, 
in which the electromagnetic field has a definite sign at every point, 
does not allow the sign of a charge to change on going around a closed 
curve non-homotopic to zero unless the orientation of time changes. 
However one could have a theory in which the field was double-valued 
and changed sign on going round sucha curve. This theory would agree 
with all existing experimental evidence.) In particular if one assumes 
that space-time is time-orientable then it must also be space- 
orientable. (This in fact follows on using the experimental evidence 
alone without appealing to the CPT theorem.) 

Geroch (1968c) has also shown that if it is possible to define two- 
component spinor fields at every point then space-time must be 
parallelizable, that is it must be possible to introduce a continuous 
system of bases of the tangent space at every point. (Further conse- 
quences of the existence of spinor structures are obtained in Geroch 
(1970a).) 


6.2 Causal curves 


Taking space-time to be time-orientable as explained in the previous 
section, one can divide the non-spacelike vectors at each point into 
future- and past-directed. For sets Y and %, the chronological future 
L'(S, UW) of F relative to VW can then be defined as the set of all points 
in @ which can be reached from ¥ by a future-directed timelike curve 
in @. (By a curve we mean always one of non-zero extent. not just a 
single point. Thus /*(-4,%) may not contain ¥. ) I°S We) Willaie 
denoted by /'(%), and is an open set, sinee if pe. can be reached by 
av future-directed timelike curve from -¥ then there is a small neigh- 
bourhood of p which can be so reached. 
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This definition has a dual in which ‘future’ is replaced by ‘past’, 
and the + by a — ; to avoid repetition, we shall regard dual definitions 
and results as self-evident. 

The causal future of S relative to W is denoted by Fe’, WY); ites 
defined as the union of Y 9 Y with the set of all points in Y which can 
be reached from Y by a future-directed non-spacelike curve in Y. We 
saw in §4.5 that a non-spacelike curve between two points which was 
not a null geodesic curve could be deformed into a timelike curve 
between the two points. Thus if Y is an open set and p, g, re %, then 


either geJ*(p,@%), rel*(q,%) 
or qel*(p, %), reJSt+(q, Z) 
From this it follows that Fip,%) = J+(p,%) and I+(p, %) = J+(p,®) 
where for any set 4%, % denotes the closure of % and 
Kh = KH \(M—-H) 
denotes the boundary of %. 


} imply relt(p,%). 


Chronological 
future [+(Y) Null geodesic in /+(¥) 
which does not intersect 
J*(f/) and has no past 
endpoint in 4 


Point removed 


Causa] froin 


future 
J*(Ff) 


Null geodesics 
through ¥ generating 
past of J+(S) 


Ficure 34. When a point has been removed from Minkowski space, the causal 
future J*(.Y) of a closed set Y is not necessarily closed. Further parts of the 
boundary of the future of .Y may be generated by null geodesic segments 
which have no past endpoints in 4. 


As before, J+(.S, M) will be written simply as J+(.S). It is the region 
of space-time which can be causally affected by events in /. It is not 
necessarily a closed set even when Y is a single point, as figure 34 
shows. This example, incidentally, illustrates a useful technique for 
constructing space-times with given causal properties: one starts 
with some simple space-time (unless otherwise indicated this will be 
Minkowski space), cuts out any closed set and, if desired, pastes it 
together in an appropriate way (i.e. one makes identifications of points 

Fae 
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of @). The result is still a manifold with a Lorentz metric and there- 
fore still a space-time even though it may look rather incomplete 
where points have been cut out. As mentioned above, however, this 
incompleteness can be cured by an appropriate conformal trans- 
formation which sends the cut out points to infinity. 

The future horismos of F relative to YW, denoted by E*(F, %), is 
defined as J+(S¥,Y)—I+(F,U); we write H+(S) for E(S,M). (In 
some papers the relations p €1*(q), p€J+(q) and pe £*(q) are denoted 
by ¢< p,q < pand gp respectively.) If Y is an open set, points of 
E+(Sf,%) must lie on future-directed null geodesics from / by 
proposition 4.5.10, and if Y is a convex normal neighbourhood about p 
then it follows from proposition 4.5.1 that E*(p,W) consists of the 
future-directed null geodesics in Y from p, and forms the boundary in 
MN of both I*+(p,%) and J+(p,%). Thus in Minkowski space, the null 
cone of p forms the boundary of the causal and chronological futures 
of p. However in more complicated space—times this is not necessarily 
the case (e.g. see figure 34). 

For the purposes of what follows it will be convenient to extend the 
definition of timelike and non-spacelike curves from piecewise dif- 
ferentiable to continuous curves. Although such a curve may not have 
a tangent vector we can still say that it is non-spacelike if locally 
every two points of the curve can be joined by a piecewise differenti- 
able non-spacelike curve. More precisely, we shall say that a con- 
tinuous curve y: F >.4#, where F is a connected interval of R!, is 
future-directed and non-spacelike if for every te F there is a neighbour- 
hood G of t in F and a convex normal neighbourhood @ of y(t) in # 
such that for any 4,6G, y(t,)eJ-(y(t),%)—y(t) if t,<t, and 
y(t) eS * (y(t), Y@) — yt) if t < ty. We shall say that y is future-directed 
and timelike if the same conditions hold with J replaced by J. Unless 
otherwise specified, we will in future mean by a timelike or non- 
spacelike curve such a continuous curve, and shall regard two curves 
as equivalent if one is a reparametrization of the other. With this 
generalization we can establish a result that will be used repeatedly 
in the rest of this chapter. We first give a few more definitions. 

A point p will be said to be a future endpoint of a future-directed 
non-spacelike curve y: F > if for every neighbourhood % of p there 
isateF such that y(t,)e% for every t,eF witht, >t. A non-spacelike 
curve is fulure-inertendible (respectively, future-inextendible in a set S ) 
if it has no future endpoint (respectively, no future endpoint in S » 
A point p will be said to be a limit point of an infinite sequence of non- 
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spacelike curves A,, if every neighbourhood of p intersects an infinite 
number of the A,. A non-spacelike curve A will be said to be a limit 
curve of the sequence A,, if there is a subsequence A’, of the A,, such that 
for every ped, A’,, converges to p. 


Lemma 6.2.1 


Let / be an open set and let A,, be an infinite sequence of non-spacelike 
curves in S which are future-inextendible in SY. If pe isa limit point 
of A,,, then through p there is a non-spacelike curve A which is future- 
inextendible in Y and which is a limit curve of the A,,. 


It is sufficient to consider the case Y = -@ since F can be regarded as 
a manifold with a Lorentz metric. Let WY, be a convex normal co- 
ordinate neighbourhood about p and let @(qg,a) be the open ball of 
coordinate radius a about qg. Let b > 0 be such that A(p, b) is defined 
and let A(1,0),, be a subsequence of A,,1 Y%, which converges to p. 
Since A(p,b) is compact it will contain limit points of the A(1,0),. 
Any such limit point y must lie either in J~(p, Y,) or J*(p, Y,) since 
otherwise there would be neighbourhoods ¥,, of y and VY, of p between 
which there would be no non-spacelike curve in Y,. Choose 


21,63 +(p,%) n B(p,b) 


to be one of these limit points (figure 35), and choose A(1, 1), to be 
a subsequence of A(1,0),, which converges to x,,. The point 2,, will be 
a point of our limit curve A. Continue inductively, defining 


x4 eJ+(p,U) 1 Bp, i-yb) 


as a limit point of the subsequence A(i— 1,i—1), forj = 0, A(2,j7—-1), 
for i >j > 1, and defining A(i,j), as a subsequence of the above 
subsequence which converges to 2;;. In other words we are dividing 
the interval [0, 6] into smaller and smaller sections and getting points 
on our limit curve on the corresponding spheres about p. As any two 
of the x,; will have non-spacelike separation, the closure of the union 
of all the z;; (j > 1) will give a non-spacelike curve A from p = 29 to 
241 = X4;. It now remains to construct a subsequence A’, of the A, such 
that for each qéA, A’, converges to g. We do this by choosing A’,, to be 
a member of the subsequence A(m,m),, which intersects each of the 
balls B(x,,;,m 1b) for 0 <j < m. Thus A will be a limit curve of the 
A, from p to x,,. Now let Y, be a convex normal neighbourhood about 
z,, and repeat the construction using this time the sequence A’,. 
Continuing in this fashion, one can extend A indefinitely. oO 
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Null geodesics 
through p 


Figure 35. The non-spacelike limit curve A through p of a family of non- 
spacelike curves A,, for which 7 is a limit point. 


6.3 Achronal boundaries 


From proposition 4.5.1 it follows that in a convex normal neighbour- 
hood Y, the boundary of [+(p, @) or J*(p, W) is formed by the future- 
directed null geodesics from p. To derive the properties of more general 
boundaries we introduce the concepts of achronal and future sets. 

A set ¥ is said to be achronal (sometimes referred to as ‘semi- 
spacelike’ in the literature) if [*(%) 0S is empty, in other words if 
there are no two points of Y with timelike separation. S is said to be 
a future setif S > It(F). Note thatif S is a future set, W—S is a past 
set. Examples of future sets inchide J+(-4') and J*(4°), where A” is 


any set. Examples of achronal sets are given by the following 
fundamental result. 
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Proposition 6.3.1 


If F is a future set then FY, the boundary of Y, is a closed, imbedded, 
achronal three-dimensional C!~ submanifold. 


If geY, any neighbourhood of q intersects Y and W—Y. If pel*(q), 
then there is a neighbourhood of q in J~(p). Thus I+(q) < ¥. Similarly 
I-(q) < (#-S). Tf reI*(q), there is a neighbourhood VW of r such 
that ¥ < I*+(q) < FY. Thus r cannot belong to Y. One can introduce 
normal coordinates (21, x?, x3, z+) in a neighbourhood Y, about q with 
o/éx* timelike and such that the curves {x* = constant (i = 1, 2, 3)} 
intersect both I*(q, Y,) and I-(q, Y,). Then each of these curves must 
contain precisely one point of Y. The x-coordinate of these points 
must be a Lipschitz function of the x? (i = 1, 2,3) since no two points 
of SY have timelike separation. Therefore the one-one map 
Gm? (\ ve h* defined by GXp) = z'(p) (¢ = 1, 2,3) for peS 1 Y, 
is a homeomorphism. Thus (FY 0 Y,, ¢,) is a Cl atlas for #. O 


We shall call a set with the properties of Y listed in proposition 6.3.1, 
an achronal boundary. Such a set can be divided into four disjoint 
subsets A, FY, F, F as follows: for a point geFY there may or 
may not exist points p,reY with peE-(q)—q, reEt(q)—q. The 
different possibilities define the subsets of Y according to the scheme: 


Ap id 
ae 
Te malt 
If ge Se then reH#*(p) since reJ*(p) and by proposition 6.3.1, 
r¢I+(p). This means that there is a null geodesic segment in SF through 
q. If oO (respectively #) then q is the future (respectively, past) 
endpoint of a null geodesic in SF. The subset SF is spacelike (more 
strictly, acausal). These divisions are illustrated in figure 36. 

A useful condition for a point to lie in A, FY, or H is given in the 
following lemma due to Penrose (Penrose (1968)): 


Lemma 6.3.2 
Let W be a neighbourhood of geY where F is a future set. Then 
(i) I+Hqg <TS-V) implies qeHU FS, 


(ii) I-(qQ) <I-(M-S-W) implies qe Ku &. 
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Ficure 36. An achronal boundary F can be divided into four sets: is space- 
like, Yy is null, and S, (respectively, Y_) is the future (respectively, past) 
endpoint of a null geodesic in F. 


It is sufficient to prove (i) since ¥ can also be regarded as the boundary 
of the past set (W@—S). Let {a,} be an infinite sequence of points in 
I+(q) 1 W which converge on q. If I~(q) < I*(f¥ —#), there will be 
a past-directed timelike curve A, to. Y—W from each x,. By lemma 


6.2.1 there will be a past-directed limit curve A from q to (7 —#). As 
I-(q) is open and contained in.@ —F, I-(q) 1 F isempty. Thus A must 
be a null geodesic and must lie in ¥. oO 


As an example of the above results, consider J+(#) =I-(#), the 
boundary of the future of a closed set %”. By proposition 6.3.1 it is an 
achronal manifold and by the above lemma, every point of J (4) - KH 
belongs to [J+(%#)]y or [J+(#)],.. This means that J/(%)-—*# is 
generated by null geodesic segments which may have future end- 
points in J+(#)— # but which, if they do have past endpoints, can 
have them only on # itself. As figure 34 shows, there may be null 
geodesic generating segments which do not have past endpoints at all 
but which go out to infinity. This example is admittedly rather 
artificial but Penrose (1965a) has shown that similar behaviour 
occurs in something as simple as the plane wave solutions; the anti- 
de Sitter (§5.2) and Reissner-Nordstrém (§5.5) solutions provide 
other examples. We shall see in § 6.6 that this behaviour is connected 
with the absence of a Cauchy surface for these solutions. 

We shall say that an open set Y is causally simple if for every 
compact set # < Y, 


JHA) VM = EHH)VU and J-(#)0W = E(#L)0&. 
This is equivalent to saying that J+(%) and J-(#) are closed in %. 
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6.4 Causality conditions 


Postulate (a) of § 3.2 required only that causality should hold locally; 
the global question was left open. Thus we did not rule out the possi- 
bility that on a large scale there might be closed timelike curves (i.e. 
timelike Ss). However the existence of such curves would seem to 
lead to the possibility of logical paradoxes: for, one could imagine 
that with a suitable rocketship one could travel round such a curve 
and, arriving back before one’s departure, one could prevent oneself 
from setting out in the first place. Of course there is a contradiction 
only if one assumes a simple notion of free will; but this is not some- 
thing which can be dropped lightly since the whole of our philosophy 
of science is based on the assumption that one is free to perform any 
experiment. It might be possible to form a theory in which there were 
closed timelike curves and in which the concept of free will was modi- 
fied (see, for example, Schmidt (1966)) but one would be much more 
ready to believe that space-time satisfies what we shall call the 
chronology condition: namely, that there are no closed timelike curves. 
One must however bear in mind the possibility that there might be 
points (maybe where the density or curvature was very high) of 
space-time at which this condition does not hold. The set of all such 
points will be called the chronology violating set of @ and has the 
following character: 


Proposition 6.4.1 (Carter) 


The chronology violating set of -@ is the disjoint union of sets of the 
form I+(q) n I-(q), gE-4. 


If g is in the chronology violating set of .@, there must be a future- 
directed timelike curve A with past and future endpoints at gq. If 
r €I-(q) n I*(q), there will be past- and future-directed timelike curves 
fi, and pu from g tor. Then (4,)-10A 0 4, will be a future-directed time- 
like curve with past and future endpoints at r. Moreover if 


re[I-(q) nI*(q)]n L-(p) 0 £*(p)] 


then pel-(q)n I+) =I (p) nL*(p). 
To complete the proof, note that every point 7 at which chronology is 
violated is in the set J-(r) n [*(r). Oo 


Proposition 6.4.2 
If .W is compact, the chronology violating set of is non-empty. 
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AM can be covered by open sets of the form I+(q), ¢-4. If the chrono- 
logy condition holds at qg, then g¢I*(q). Thus if the chronology 
condition held at every point, could not be covered by a finite 
number of sets of the form [*(q). Oo 


From this result it would seem reasonable to assume that space-time 
is non-compact. Another argument against compactness is that any 
compact, four-dimensional manifold on which there is a Lorentz 
metric cannot be simply connected. (The existence of a Lorentz metric 
implies me the Euler number y(-@) is zero (Steenrod (1951), p. 207). 
Now y = > (—1)"B, where B,, > 0 is the nth Betti number of 4. By 
n=0 
duality (Spanier (1966), p. 297) B, = B,_,. Since By = B, = 1, this 
implies that B, + 0 which in turn implies 7,(.W) + 0 (Spanier (1966), 
p. 398).) Thus a compact space-time is really a non-compact manifold 
in which points have been identified. It would seem physically reason- 
able not to identify points but to regard the covering manifold as 
representing space-time. 
We shall say that the causality condition bau if there are no closed 
non-spacelike curves. Similar to proposition 6.4.1, one has: 


Proposition 6.4.3 


The set of points at which the causality condition does not hold i is the 
disjoint union of sets of the form J~(qg) nN J7(q), gE-4. O 


In particular, if the causality condition is violated at g¢-W but the 
chronology condition holds, there must be a closed null geodesic 
curve y through qg. Let v be an affine parameter on y (regarded as a map 
of an open interval of R' to .#) and let ...,v_,, Vp, vy, Vg; -.. be successive 
values of v at g. Then we may compare oi q the tangent ee 0/00» 

and the tangent vector @/@v|,_,,, obtained by parallelly transporting 
é/év|,-», round y. Since they both point in the same direction, they 
must jbe proportional: @/év|,_,, = @é/év|,_,,. The factor a has the 
following significance: the affine distance covered in the nth circuit of 
Y; (Uns1— Un), is equal to a~"(v, — vy). Thus ifa > 1, v never attains the 
value (v,—v9)(1—a~1)—! and so y is geodesically incomplete in the 
future direction even though one can go round an infinite number of 
times. Similarly if a < 1, y is incomplete in the past direction, while if 
a = 1, it is complete in both directions. In the two-dimensional model 
of Taub-NUT space described in § 5.7, there is a closed null geodesic 
which is an example with a > 1. Since the factor a is a conformal in- 
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variant, this incompleteness is independent of the conformal factor. 
This kind of behaviour, however, can happen only if there is a violation 
of causality in some sense; if the strong causality condition (see below) 
holds, a suitable conformal transformation of the metric will make all 
null geodesics complete (Clarke (1971)). 

The factor a has a further significance from the following result. 


Proposition 6.4.4 


If y is a closed null geodesic curve which is incomplete in the future 
direction then there is a variation of y which moves each point of y 
towards the future and which yields a closed timelike curve. 


By §2.6, one can find on -@ a timelike line-element field (V, — V) 
normalized so that g(V, V) = — 1. As we are assuming that -@ is time- 
orientable, one can consistently choose one direction of (V, — V) and 
so obtain a future-directed timelike unit vector field V. One can then 
define a positive definite metric g’ by 


g'(X, Y) = g(X, 6 oe 29(X, V) g(Y, WO 


Let t be a (non-affine) parameter on y which is zero at some point 
qey and which is such that g(V, 0/ét) = — 2-4. Then t measures proper 
distance along y in the metric §’ and has the range — 00 < ¢ < oo. Con- 
sider a variation of y with variation vector 0/éu equal to xV, where x is 
a function x(t). By § 4.5; 


ae re fe ai (aes 
Deu aa! = a ewe) Tawa a 


dx 
= —} —— 
ae) 
where f @/ét = (D/ét) (2/ét). Now suppose v were an affine parameter 


on y. Then é/év would be proportional to @/ét: @/év = hoe/ét, where 
h-1 dh/dt = —f. On going round one circuit of y, 0/év increases by 


a factor a > 1. Thur 
$ fat =—loga< 0. 
Therefore if we take x(t) to be 


t 
exp (| f(t’) dt’ + bt log a) ; 
0 


where 6 = $ dt, this will give a variation of y to the future and gives 
a closed timelike curve. oO 
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Proposition 6.4.5 
If (a) R,,K*K® > 0 for every null vector K; 

(b) the generic condition holds, i.e. every null geodesic contains a 
point at which K,,RsaeK »K°K? is non-zero, where K is the tangent 
vector; 

(c) the chronology condition holds on -@, 
then the causality condition holds on -#. 


If there were closed null geodesic curves which were incomplete, then 
by the previous result they could be varied to give closed timelike 
curves. If they were complete, then by proposition 4.4.5 they would 
contain conjugate points and so by proposition 4.5.12 they could 
again be varied to give closed timelike curves. O 


This shows that in physically realistic solutions, the causality and 
chronology conditions are equivalent. 

As well as ruling out closed non-spacelike curves, it would seem 
reasonable to exclude situations in which there were non-spacelike 
curves which returned arbitrarily close to their point of origin or 
which passed arbitrarily close to other non-spacelike curves which then 
passed arbitrarily close to the origin of the first curve—and so on. In 
fact Carter (1971a) has pointed out that there isa more than countably 
infinite hierarchy of such higher degree causality conditions depending 
on the number and order of the limiting processes involved. We shall 
describe the first three of these conditions and shall then give the 
ultimate in causality conditions. 

The future (respectively, past) distinguishing condition (Kronheimer 
and Penrose (1967)) is said to hold at p €.4 if every neighbourhood of p 
contains a neighbourhood of p which no future (respectively, past) 
directed non-spacelike curve from p intersects more than once. An 
equivalent statement is that J+(q) = I*(p) (respectively, I-(q) = I~-(p)) 
implies that g = p. Figure 37 shows an example in which the causality 
and past distinguishing conditions hold everywhere but the future 
distinguishing condition does not hold at p. 

The strong causality condition is said to hold at p if every neighbour- 
hood of p contains a neighbourhood of p which no non-spacelike curve 
intersects more than once. Figure 38 shows an example of violation of 
this condition. 
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FiGuRE 37. A space in which the causality and past distinguishing conditions 
hold everywhere, but the future distinguishing condition does not hold at » or q 
(in fact, 1+(p) = I+(q)). The light cones on the cylinder tip over until one null 
direction is horizontal, and then tip back up; a strip has been removed, thus 
breaking the closed null geodesic that would otherwise occur. 
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Ficure 38. A space-time satisfying the causality, future and past distinguish- 
ing conditions, but not satisfying the strong causality condition at p. Two 
strips have been removed from a cylinder; light cones are at + 45°. 


194 CAUSAL STRUCTURE [6.4 


Proposition 6.4.6 
If conditions (a) to (c) of proposition 6.4.5 hold and if in addition, 
(d) &@ is null geodesically complete, then the strong causality condi- 
tion holds on -4@. 


Suppose the strong causality condition did not hold at pe.#. Let Y 
be a convex normal neighbourhood of p and let V, < WY be an infinite 
sequence of neighbourhoods of p such that any neighbourhood of p 
contains all the V, for n large enough. For each V, there would be a 
future-directed non-spacelike curve A,, which left Y and then returned 
to V,. By lemma 6.2.1, there would be an inextendible non-spacelike 
curve A through p which was a limit curve of the A,,. No two points of A 
could have timelike separation as otherwise one could join up some A,, 
to give a closed non-spacelike curve. Thus A must be a null geodesic. 
But by (a), (6) and (d) A would contain conjugate points and therefore 
points with timelike separation. 0 


Corollary 


The past and future distinguishing conditions would also hold on .# 
since they are implied by strong causality. 

~ 
Closely related to these three higher degree causality conditions is 
the phenomenon of imprisonment. 

A non-spacelike curve y that is future-inextendible can do one of 
three things as one follows it to the future: it can 

(i) enter and remain within a compact set Y, 

(ii) not remain within any compact set but continually re-enter 
a compact set /, 

(i) not remain within any compact set Y and not re-enter any 
such set more than a finite number of times. 

In the third case y can be thought of as going off to the edge of 
space-time, that is either to infinity or a singularity. In the first and 
second cases we shall say that y is totally and partially future im prisoned 
in /, respectively. One might think that imprisonment could occur 
only if the causality condition was violated, but the example due to 
Carter which is illustrated in figure 39 shows that this is not the case. 
Nevertheless one does have the following result: 
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Ficvre 39. A space with imprisoned non-spacelike lines but no closed non- 
spacelike curves. The manifold is R! x S! x S! described by coordinates (t, y, 2) 
where (¢. y. z) and (¢, y.2+1) are identified, and (¢, y, z) and (y, y+1, z+a) are 
identified, where « is an irrational number. The Lorentz metric is given by 


ds? = (cosht— 1)? (dé? — dy?) + dé dy —dz?. 


(i) A section {z = constant} showing the orientation of the null cones. 
(ii) The section ¢ = 0 showing part of a null geodesic. 


Proposition 6.4.7 

If the strong causality condition holds on a compact set Y, there can 
be no future-inextendible non-spacelike curve totally or partially 
future imprisoned in . 


YF can be covered by a finite number of convex normal coordinate 
neighbourhoods Y,; with compact closure, such that no non-spacelike 
curve intersects any YW; more than once. (We shall call such neighbour- 
hoods, local causality neighbourhoods.) Any future-inextendible non- 
spacelike curve which intersects one of these neighbourhoods must 
leave it again and not re-enter it. 0 


Proposition 6.4.8 

If the future or past distinguishing condition holds on a compact 
set Y, there can be no future-inextendible non-spacelike curve 
totally future imprisoned in /. (This result is included for its interest 
but is not needed for what follows.) 


Let {Y,}, (w = 1,2,3,...), be a countable basis of open sets for 
(i.e. any open set in & can be represented as a union of the 7,). As 
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the future or past distinguishing condition holds on /, any point 
pe will have a convex normal coordinate neighbourhood Y such 
that no future (respectively, past) directed non-spacelike curve from p 
intersects WY more than once. We define f(p) to be equal to the least 
value of a such that ¥, contains p and is contained in some such 
neighbourhood @%. 

Suppose there were a future-inextendible non-spacelike curve A 
which was totally future imprisoned in . Let geEA be such that 
N' = An J*(q) is contained in Y. Define .% to be the closed, non- 
empty set consisting of all points of Y which are limit points of A. Let 
Py € BH, be such that f(p,) is equal to the smallest value of f(p) on %. 
Through p, there would be an inextendible non-spacelike curve 7 
every point of which was a limit point of A’. No two points of y, could 
have timelike separation since otherwise some segment of A’ could be 
deformed to give a closed non-spacelike curve. Thus y) would be an 
inextendible null geodesic which was totally imprisoned in in both 
the past and future directions. Let <4 be the closed set consisting of 
all limit points of yy) N J*(po) (or, in the case that the past distinguishing 
condition holds on Y, yy J-(po)). As every such point would also be 
a limit point of A’, , © WH). Since Yj, ) could contain no limit point 
of yy N J*(po) (respectively, yo N J~(po)), A would be strictly smaller 
than ). We would thus obtain an infinite sequence of Mosed sets 
Ly > A, > A,>...> B,>.... Each .%, would be non-empty, 
being the set of all limit points of the totally future (respectively, past) 
imprisoned null geodesic Yp-1 0S *(p,_1) (respectively, Vp-19 I~(Pp_s))- 
Leiee — (1p As # is compact, would be non-empty since the 


intersection of any finite number of the <A, would be non-empty 
(Hocking and Young (1961), p. 19). Suppose r¢.%. Then f(r) = f(p,) 
for some f. But Yj.) .%,, would be empty so 7 could not be in 
,,, and so could not be in #. This shows that there can be no future- 
inextendible non-spacelike curve totally future imprisonedin Y. O 


The causal relations on (.W, 8) may be used to put a topology on @ 
called the Alexandrov topology This is the topology in which a set is 
defined to be open if and only if it is the union of one or more sets of the 
form I+(p)n I-(q), p, gE. As I+(p)n I~(q) is open in the manifold 
topology, any set which is open in the Alexandrov topology will be 
open in the manifold topology, though the converse is not necessarily 
true. 


Suppose however that the strong causality condition holds on .4. 
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Then about any point re. one can find a local causality neighbour- 
hood %. The Alexandrov topology of (W,8|,) regarded as a space— 
time in its own right, is clearly the same as the manifold topology of %. 
Thus the Alexandrov topology of .W is the same as the manifold 
topology since -@ can be covered by local causality neighbourhoods. 
This means that if the strong causality condition holds, one can 
determine the topological structure of space-time by observation of 
causal relationships. 
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Ficure 40. A space satisfymg the strong causality condition, but in which 
the slightest variation of the metric would permit there to be closed timelike 
lines through p. Three strips have been removed from a cylinder; light cones 
are at + 45°. 


Even imposition of the strong casuality condition does not rule out 
all causal pathologies, as figure 40 shows one can still have a space— 
time which is on the verge of violating the chronology condition in that 
the slightest variation of the metric can lead to closed timelike curves. 
Such a situation would not seem to be physically realistic since 
General Relativity is presumably the classical limit of some, as yet 
unknown, quantum theory of space-time and in such a theory the 
Uncertainty Principle would prevent the metric from having an exact 
value at every point. Thus in order to be physically significant, a 
property of space-time ought to have some form of stability, that is 
to say, it should also be a property of ‘nearby’ space—times. In order 
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to givea precise meaning to ‘nearby’ one has to define a topology on the 
set of all space-times, that is, all non-compact four-dimensional mani- 
folds and all Lorentz metrics on them. We shall leave the problem of 
uniting in one connected topological space manifolds of different 
topologies (this can be done); and shall just consider putting a topology 
on the set of all C7 Lorentz metrics (7 > 1) on a given manifold. There 
are various ways in which this can be done, depending on whether one 
requires a ‘nearby’ metric to be nearby in just its values (C° topology) 
or also in its derivatives up to the kth order (C* topology) and whether 
one requires it to be nearby everywhere (open topology) or only on 
compact sets (compact open topology). 

For our purposes here, we shall be interested in the C® open topology. 
This may be defined as follows: the symmetric tensor spaces 7,9(p) of 
type (0,2) at every point peé.W form a manifold (with the natural 
structure) 7.9(.4), the bundle of symmetric tensors of type (0, 2) over 
M. A Lorentz metric g on -@ is an assignment of an element of T,$(-#) 
at each point p€.@ and so can be regarded as a map or cross-section 
9: A> T.$(-) such that 709 = 1 where7ris the projection T,3(4#)—> @ 
which sends xE7,$(p) to p. Let Y be an open set in 7,$(-#) and let 
O(2) be the set of all C° Lorentz metrics § such that 9(-@) is contained 
in Y (figure 41). Then the open sets in the C® open topology of the C’ 
Lorentz metrics on -@ are defined to be the union of one of more sets 
of the form O(%). 

We say that the stable causality condition holds on -@ if the space- 
time metric § has an open neighbourhood in the C® open topology 
such that there are no closed timelike curves in any metric belonging 
to the neighbourhood. (It would not make any difference if one used 
the C* topology here, but one could not use a compact open topology 
since in that topology each neighbourhood of any metric contains 
closed timelike curves.) In other words, what this condition means is 
that one can expand the light cones slightly at every point without 
introducing closed timelike curves. 


Proposition 6.4.9 


The stable causality condition holds everywhere on .# if and only if 
there is a function f on @ whose gradient is everywhere timelike. 


Remark. The function f can be thought of as a sort of cosmic time in 
the sense that it increases along every future-directed non-spacelike 
curve. 
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Proof. The existence of a function f with an everywhere timelike 
gradient implies the stable causality condition since there can be no 
closed timelike curves in any metric h which is sufficiently close to 
that for every point p €.%, the null cone of p in the metric h intersects 
the surface {f = constant} through p only at p. To show that the con- 
verse is true we introduce a volume measure uz (unrelated to the volume 
measure defined by the metric §) on -@ such that the total volume of 


Tsi(p) 
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FicureE 41. An open set Y in the C® open topology on the space 7'9(.@) of 
symmetric tensors of type (0, 2) on .@. 


M is one. One way of doing this is as follows: choose a countable atlas 
(W.,9,) for @ such that ¢,(Y,) is compact in R*. Let “, be the natural 
Euclidean measure on #4 and let f, be a partition of unity for the atlas 
(W., G~)- Then # may be defined as >) f, 2-*[u#o(%..)] 1 G.* Ho- 


Now if the stable causality condition holds one can find a family of 
Ct Lorentz metrics h(a), ae [0, 3], such that: 

(1) h(0) is the space-time metric §; 

(2) there are no closed timelike curves in the metric h(a) for each 
ae[0, 3]; 

(3) if a,, a,€[0, 3] with a, < a, then every non-spacelike vector in 
the metric h(a,) is timelike in the metric h(a,). 
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For pc.&, let 0(p, a) be the volume of I~(p, 4, h(a)) in the measure 
js where we use 1-(.Y,%,h) to denote the past of F relative to Y in 
the metric h. For a given value of a€ (0, 3), O(p, a) will be a bounded 
function which increases along every non-spacelike curve. It may not, 
however, be continuous: as figure 42 shows, it may be possible that 
a slight alteration of position may allow one to see past an obstruction 
and so greatly increase the volume of the past. One thus needs some 
way of smearing out 6(p, a) so as to obtain a continuous function which 
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Ficure 42. A small displacement of a point from p to q results in a large change 
in the volume of the past of the point. Light cones are at +45° anda strip has 
been removed as shown. 


increases along every curve which is future-directed and non-spacelike 
in the metric h(0). One can do this by averaging over a range of a: let 


m 2 
B(p) = { -(p,0) da 


We shall show that 6(p) is continuous on -#. 

First to show that it is upper semi-continuous: givene > 0, let # be 
a ball about p such that the volume of # in the measure /4 is less than 
2€. By property (3), for ay, a,€[0,3] with a, <a, one can find a 
neighbourhood F (a,, a.) of p in B such that 


[I-(F (a, a2), B, (a,)) NB] < [I-(p, B, (ay) 0 B). 


Let x be a positive integer greater than 2e-1, Then we define the set G 
to be = (VF (1 le? 1+3(¢+1)n—),7=0,1,..., 2n. G will be 
v 
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a neighbourhood of p and will be contained in F(a, a+n—) for any 
a@e[1, 2]. Therefore [-(q,-4, h(a)) —Z will be contained in 


I-(p,M,h(a+n))—-B for geY and ae[i,2]. 
Thus A(q,a) < O(p,a+ fe) +e 


and so 6(q) < 6(p)+e, showing that 6 is upper semi-continuous. The 
proof that it is lower semi-continuous is similar. To obtain a differenti- 
able function one can average 6 over a neighbourhood of each point 
with a suitable smoothing function. By taking the neighbourhood 
small enough one can obtain a function f which has everywhere a time- 
like gradient in the metric g. Details of this smoothing procedure are 
given in Seifert (1968). Oo 


The spacelike surfaces {f = constant} may be thought of as surfaces 
of simultaneity in space-time, though of course they are not unique. 
If they are all compact they are all diffeomorphic to each other, but 
this is not necessarily true if some of them are non-compact. 


6.5 Cauchy developments 


In Newtonian theory there is instantaneous action-at-a-distance and 
so in order to predict events at future points in space-time one has to 
know the state of the entire universe at the present time and also to 
assume some boundary conditions at infinity, such as that the 
potential goes to zero. In relativity theory, on the other hand, it 
follows from postulate (a) of §3.2 that events at different points of 
space-time can be causally related only if they can be joined by a 
non-spacelike curve. Thus a knowledge of the appropriate data on 
a closed set ¥ (if one knew data on an open set, that on its closure 
would follow by continuity) would determine events in a region D*(S) 
to the future of ¥ called the future Cauchy development or domain of 
dependence of Y, and defined as the set of all points pe.W such that 
every past-inextendible non-spacelike curve through p intersects / 
(NBS) i7) 7): 

Penrose (1966, 1968) defines the Cauchy development of S slightly 
differently, as the set of all points pe.@ such that every past- 
inextendible timelike curve through p intersects Y. We shall denote 
this set by D+(7). One has the following result: 
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Proposition 6.5.1 
Dt(f) = D*(F). 


Clearly Dt(S) > D*(F). lfge HW — D*(F) there is a neighbourhood Y 
of q which does not intersect Y. From q there is a past-inextendible 
curve A which does not intersect S. If rea n I-(g, Y) then I*(r, Z) is 
an open neighbourhood of g in M —Dt(S). Thus #—D+(F) is open 
and the set D+(S) is closed. Suppose there were a point peDt(F) 
which had a neighbourhood Y which did not intersect D*(/). Choose 
a point «el-(p,W). From x there would be a past-inextendible non- 
spacelike curve y which did not intersect Y. Let y, be a sequence of 
points on y which did not converge to any point and which were such 
that y,,,; was to the past of y,. Let ¥,, be convex normal neighbour- 
hoods of the corresponding points y, such that ¥ ,,., did not intersect 
WW, Let z, be a sequence of points such that 


Geet (Yaa e ey) q I-(¢,,M-F). 


There would be an inextendible timelike curve from p which passed 
through each point z, and which did not intersect “. This would con- 
tradict pe D+(.S). Thus D*(f) is contained in the closure of D7(F), 


and so Dt(S) = D*(f). O 


The future boundary of D*+(S), that is D*(/) — I-(D*+(F)), marks the 
limit of the region that can be predicted from knowledge of data on %. 
We call this closed achronal set the future Cauchy horizon of F and 
denote it by H+(.). As figure 43 shows, it will intersect F if F is null 
or if S has an ‘edge’. To make this precise we define edge (.) for an 
achronal set / as the set of all points g¢ Y such that in every neigh- 
bourhood Y of q there are points pel-(q, ¥) and reI*(q, WY) which 
can be joined by a timelike curve in YW which does not intersect Y. By 
an argument similar to that in proposition 6.3.1 it follows that if 
edge (/) is empty for a non-empty achronal set Y, then F is a three- 
dimensional imbedded C!~ submanifold. 


Proposition 6.5.2 
For a closed achronal set ¥, 


‘ 


edge (H*(.S)) = edge(.F). 
Let %,, be a sequence of neighbourhoods of a point geedge (H+(S )) 
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such that any neighbourhood of g encloses all the U,, for n sufficiently 
large. In each &, there will be points p, €1-(q, U,) and r,élt(q,%,,) 
which can be joined by a timelike curve A,, which does not intersect 
el Fy, This means that A,, cannot intersect D+(). By proposition 
6.5.1, ge D*(S) and so I-(q) < I-(D+(F)) < I-(F) u D+(F). Thus p, 
must lie in J~(.7). Also every timelike curve from g which is inextend- 
ible in the past direction must intersect Y. Therefore for each n, there 


HHS) and F 


Remove % 


H+(¥) and F 


Edge (7) and S and H+(F) 
edge H+(F) 


a 


Edge (/) and 
edge (H*()) 


F 


Ficure 43. The future Cauchy development D+() and future Cauchy horizon 
H~(/) of a closed set Y which is partly null and partly spacelike. Note that 
H*+*(f) is not necessarily connected. Null lines are'at + 45° and a strip has been 
removed. 


must be a point of Y on every timelike curve in Y, between q and p,, 
and so g must lie in ¥. As the curves A,, do not intersect , g lies in 
edge(.). The proof the other way round is similar. oO 


Proposition 6.5.3 

Let ¥ be a closed achronal set. Then H*() is generated by null 
geodesic segments which either have no past endpoints or have past 
endpoints at edge (7). 


The set F = D+(S) UI-(P#) is a past set. Thus by proposition 6.3.1 
F is an achronal C- manifold. H+(V) is a closed subset of F. Let ¢ be 
a point of Ht(/)—edge(S). If g is not in Y then gel*(/) since 
qe D*(F). As # is achronal one can find a convex normal neighbour- 
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hood W of g which does not intersect I[-(S). Alternatively if g isin J, 
let W be a convex normal neighbourhood of g such that no point of 
I+(q, W) can be joined to any point in I-(q,W) by a timelike curve 


in W which does not intersect /. 
In either case, if p is any point in 
I*(q) there must be a past-directed time- 
like curve from p to some point of 
M—-F—W since otherwise p would be 
in Dt(S). Therefore by condition (i) of 
lemma 6.3.2, applied to the future set 
M~F,qeF_ VF. a 


Corollary 

If edge (7) vanishes, then H*(S), if non- 
empty, is an achronal three-dimensional 
imbedded C!~ manifold which is gener- 
ated by null geodesic segments which 
have no past endpoint. 


We shall call an acausal set Y with no 
edge, a partial Cauchy surface. That is, a 
partial Cauchy surface is a spacelike 
hypersurface which no non-spacelike 
curve intersects more than once. Suppose 
there were a connected spacelike hyper- 
surface S (with no edge) which some non- 
spacelike curve A intersected at points p, 
and p,. Then one could join p, and p, by 
a curve “in S and u U A would be a closed 
curve which crossed Y once only. This 
curve could not be continuously deformed 
to zero since such a deformation could 
change the number of times it crossed Y 
by an even number only. Thus .# could 
not be simply connected. This means we 
could ‘unwrap’ .@ by going to the simply 


FiIGuRE 44. ¥Y is a connected 
spacelike hypersurface without 
edge in .#. It is not a partial 
Cauchy surface; however each 
image 7-'(/) of F in the uni- 
versal covering manifold .W of 
-#, is a partial Cauchy surface 


in &. 


connected universal covering manifold .# in which each connected 
component of the image of ¥ is a spacelike hypersurface (with no 
edge) and is therefore a partial Cauchy surface in /@ (figure 44). How- 
ever going to the universal covering manifold may unwrap -# more 
than is required to obtain a partial Cauchy surface and may result in 
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the partial Cauchy surface being non-compact even though Y was 
compact. For the purposes of the following chapters we would like 
a covering manifold which unwrapped @ sufficiently so that each con- 
nected component of the image of Y was a partial Cauchy surface but 
so that each such component remained homeomorphic to . Such a 
covering manifold may be obtained in at least two different ways. 

Recall that the universal covering manifold may be defined as the 
set of all pairs of the form (p,[A]) where p¢.W and where [A] is an 
equivalence class of curves in .@ from some fixed point ge.4 to p, 
which are homotopic modulo q and p. The covering manifold -4,, is 
defined as the set of all pairs (p, [A]) where now [A] is an equivalence 
class of curves from Y to p homotopic modulo ¥ and 7p (i.e. the end- 
points on Y can be slid around). -@, may be characterized as the 
largest covering manifold such that each connected component of the 
image of SY is homeomorphic to Y. The covering manifold 4H, 
(Geroch (19676)) is defined as the set of all pairs (p, [A]) where this 
time [A] is an equivalence class of curves from a fixed point ¢ to p 
which cross Y the same number of times, crossings in the future direc- 
tion being counted positive and those in the past direction, negative. 
M, may be characterized as the smallest covering manifold in which 
each connected component of the image of Y divides the manifold into 
two parts. In each case the topological and differential structure of the 
covering manifold is fixed by requiring that the projection which maps 
(p, [A]) to x is locally 4 diffeomorphism. 

Define D(Y) = Dt(S) U D-(F). A partial Cauchy surface is said 
to be a global Cauchy surface (or simply, a Cauchy surface) if D(S) 
equals @. That is, a Cauchy surface is a spacelike hypersurface which 
every non-spacelike curve intersects exactly once. The surfaces 
{a4 = constant} are examples of Cauchy surfaces in Minkowski space, 
but the hyperboloids 


{(x4)? — (a3)? — (x?)?— (21)? = constant} 


are only partial Cauchy surfaces since the past or future null cones of 
the origin are Cauchy horizons for these surfaces (see §5.1 and 
figure 13). Being a Cauchy surface is a property not only of the surface 
itself but also of the whole space-time in which it is imbedded. For 
example, if one cuts a single point out of Minkowski space, the 
resultant space-time admits no Cauchy surface at all. 

If there were a Cauchy surface for -4, one could predict the state of 
the universe at any time in the past or future if one knew the relevant 
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data on the surface. However one could not know the data unless one 
was to the future of every point in the surface, which would be impos- 
sible in most cases. There does not seem to be any physically com- 
pelling reason for believing that the universe admits a Cauchy surface; 
in fact there are a number of known exact solutions of the Einstein 
field equations which do not, among them the anti-de Sitter space, 
plane waves, Taub—NUT space and Reissner—Nordstrém solution, all 
described in chapter 5. The Reissner—Nordstrém solution (figure 25) 
is a specially interesting case: the surface Y shown is adequate for 
predicting events in the exterior regions I where r > r, and in the 
neighbouring region II where r_ < r < 7,, but then there is a Cauchy 
horizon at r = r_. Points in the neighbouring region III are not in 
D*() since there are non-spacelike curves which are inextendible in 
the past direction and which do not cross r = r_ but approach the 
points 7+ (which may be considered to be at infinity) or the singularity 
atr = 0 (which cannot be considered to be in the space-time; see § 8.1). 
There could be extra information coming in from infinity or from the 
singularity which would upset any predictions made simply on the 
basis of data on /. Thus in General Relativity one’s ability to predict 
the future is limited both by the difficulty of knowing data on the 
whole of a spacelike surface and by the possibility that even if one did 
it would still be insufficient. Nevertheless despite theseNimitations 
one can still predict the occurrence of singularities under certain 
conditions. 


6.6 Global hyperbolicity 


Closely related to Cauchy developments is the property of global 
hyperbolicity (Leray (1952)). A set -4is said to be globally hyperbolic 
if the strong causality assumption holds on.4 and if for any two points 
p.qe, J+(p) 0 J~(q) is compact and contained in -{~. Ina sense this 
can be thought of as saying that J‘(p)n J-(q) does not contain any 
points on the edge of space-time, i.e. at infinity or at a singularity. 
The reason for the name ‘global hyperbolicity’ is that on , the wave 
equation for a é-function source at pet” has a unique solution which 
vanishes outside W—J+(p,.4) (see chapter 7). 

Recall that —4 is said to be causally simple if for every compact set 
H# contained in A’, J+(#) nV and J-(#)9-¥ are closed in W. 
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Proposition 6.6.1 
An open globally hyperbolic set .4” is causally simple. 


Let p be any point of 4”. Suppose there were a point 
qe (J*(p)—I*(p)) nV. 

As A is open, there would be a point re(I+(q)n -“”). But then 
qeJ*(p)n I(r), which is impossible as J+(p) 9 J-(r) would be compact 
and therefore closed. Thus J*(p) ) 4° and J-(p) nV are closed in /. 

Now suppose there exists a point ge (J+(#) —J+(#))n . Let q, 
be an infinite sequence of points in I+(qg)n-V converging to q, with 
Qn+1 €1~(q,,). For each n, J-(q,)N # would be a compact non-empty 
set. Therefore 1) {J~(q,) “} would be a non-empty set. Let p be a 


point of this set. Then J+(p) would contain q,, for all n. But J+(p) is 
closed. Therefore J*+(p) contains q. O 


Corollary 
If #, and #2 are compact sets in VY, J+(%) n J-(&) is compact. 
One can find a finite number of points p;¢.W such that 
{UI*(p,)} > 4. 
Similarly, there will be a finite number of points q¢, with %3 contained in 
UJ~(q,). 
J 
Then J+(.%) n J-(#,) will be contained in 
U {J*(p,) nN J~(q,)} 


4,9 


and will be closed. O 


Leray (1952) did not, in fact, give the above definition of global 
hyperbolicity but an equivalent one which we shall present: for points 
p,qgE such that strong causality holds on J*(p)n J~(q), we define 
C(p,q) to be the space of all (continuous) non-space-like curves from 
p to q, regarding two curves y(t) and A(u) as representing the same 
point of C(p, q) if one is a reparametrization of the other, 1.e. if there is 
a continuous monotonic function f(u) such that y(f(w)) = A(w). 
(C(p, q) can be defined even when the strong causality condition does 
not hold on J*+(p) n J~(q), but we shall only be interested in the case in 
which its does hold.) The topology of C(p, ) is defined by saying that 
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a neighbourhood of y in C(p,q) consists of all the curves in C(p,q) 
whose points in -# lie in a neighbourhood W of the points of y in -@ 
(figure 45). Leray’s definition is that an open set V is globally hyper- 
bolic if C(p, g) is compact for all p,geV. These definitions are equi- 
valent, as is shown by the following result. 


a 


“Nie 


Ficure 45. A neighbourhood W of the points of y in. #. A neighbourhood of y 
in C(p, q) consists of all non-spacelike curves from p to q whose points liein #. 


Proposition 6.6.2 (Seifert (1967), Geroch (19706)). = 
Let strong causality hold on an open set.#” such that 


WN =I-(N)0 FY). 


Then / is globally hyperbolic if and only if C(p, q) is compact for all 
P,GEN. 


Suppose first that C(p, q) is compact. Let 7,, be an infinite sequence of 
points in J+(p)n J~(q) and let A,, be a sequence of non-spacelike curves 
from p to g through the corresponding r,,. As C(p, g) is compact, there 
will be a curve A to which some subsequence A’,, converges in the 
topology on C(p, q). Let YW be a neighbourhood of A in.@ such that Y is 
compact. Then Y will contain all A’,, and hence all 7’, for n sufficiently 
large, and so there will be a point 7 € Y which is a limit point of the 7’,. 
Clearly r lies on A. Thus every infinite sequence in J*+(p)n J~(q) has a 
limit point in J*(p)n J-(q). Hence J+(p)n J~(q) is compact. 
Conversely, suppose J+(p)n J~(q) is compact. Let A,, be an infinite 
sequence of non-spacelike curves from p to g. By lemma 6.2.1 applied 
to the open set #—gq, there will be a future-directed non-spacelike 
curve A from p which is inextendible in # — gq, and is such that there is 
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a subsequence A’,, which converges to 7 for every r€A. The curve A 
must have a future endpoint at ¢g since by proposition 6.4.7 it cannot be 
totally future imprisoned in the compact set J+(p)n J-(q), and it 
cannot leave the set except at q. 

Let Y be any neighbourhood of A in W and let r, (1 <i < k) bea 
finite set of points on A such that r, = p, r, = q and each r, has a 
neighbourhood ¥; with J+(V;)n J-(¥;,,) contained in Y. Then for 
sufficiently large n, A’,, will be contained in Y. Thus A’,, converge to A 
in the topology on C(p, q) and so C(p, g) is compact. Oo 


The relation between global hyperbolicity and Cauchy developments 
is given by the following results. 


Proposition 6.6.3 
If Y is a closed achronal set, then int (D(Y)) = D(.Y)—D(PF), if non- 
empty, is globally hyperbolic. 


We first establish a number of lemmas. 


Lemma 6.6.4 

If pe D*(S)—H*(F), then every past-inextendible non-spacelike 
curve through p intersects [-(). 

Let p be in Dt(.Y)—H*(/) and let y be a past-inextendible non- 
spacelike curve through p. Then one can find a point ge Dt (.Y) qn I+(p) 
and a past-inextendible non-spacelike curve A through q such that for 
each point x A there is a point yey with yel-(x). As A will intersect 
FY at some point x, there will be a y,Eeyn I-(7). Oo 


Corollary 

If peint (D(¥)) then every inextendible non-spacelike curve through 
p intersects I-(/) and I*(f). 

int(DF))= DA\jAtSf)vua(f)}. I pel) or I-(/) the 
result follows immediately. If pe D+(.Y) —I*(f/) then peY © D-(F) 
and the result again follows. O 


Lemma 6.6.5 
The strong causality condition holds on int D(7). 
Suppose there were a closed non-spacelike curve A through 


peint(D(Y)). By the previous result there would be points 
qeAn I-(F) and rean I+(S). As reJ~(q), it would also be in /~(7) 
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which would contradict the fact that Y is achronal. Thus the causality 
condition holds on int (D(.A)). Now suppose that the strong causality 
condition did not hold at p. Then as in lemma 6.4.6 there would be an 
infinite sequence of future-directed non-spacelike curves A, which 
converged to an inextendible null geodesic y through p. There would 
be points geyn I-(f) and reyn I+(f/) and so there would be some 
A,, which intersected I[+(S) and then J-(#), which would contradict 
the fact that Y was achronal. O 


Proof of proposition 6.6.3. We wish to show that C(p,q) is compact 
for p,qeint (D(f)). Consider first the case that p,qgel~(/) and sup- 
pose peJ~(q). Let A,, be an infinite sequence of non-spacelike curves 
from g to p. By lemma 6.2.1 there will be a future-directed non- 
spacelike limit curve from p which is inextendible in .4 — qg. This must 
have a future endpoint at q since otherwise it would intersect Y which 
would be impossible as gel-(f). Consider now the case that 
ped(Ff), qedt(S) J*(p). If the limit curve A has an endpoint at q, 
it is the desired limit point in C(p,q). If it does not have an endpoint 
at g, it would contain a point yel+(S) since it is inextendible in 
M —q. Let A’,, be a subsequence which converges to ry for every point 7 
on A between p and y. Let A be a past-directed limit curve from q of 
the X’,,. fA has a past endpoint at p, it would be the desired Umit point 
in C(p, q). fA passed through y, it could be joined up with A to provide 
a non-spacelike curve from p to g which would be the desired limit 
point in C(p,q). Suppose 4 does not have endpoint at p and does not 
pass through y. Then it would contain some point z¢1-(.S). Let A", be 
a subsequence of the A’,, which converges to r for every point r on A 
between g and z. Let Y be an open neighbourhood of A which does not 
contain y. Then for sufficiently large n, all A”,,9 J+(S) would be con- 
tained in ¥”. This would be impossible as y is a limit point of the 2”,,. 
Thus there will be a non-spacelike curve from p to g which is a limit 
point of the A,, in C(p, q). 

The cases p,gel(S) and peJ-(S), gqeJ*(F) together with their 
duals cover all possible combinations. Thus in all cases we get a non- 
spacelike curve from p to g which is a limit point of the A,, in the 
topology on C(p, q). Oo 


By a similar procedure one can prove: 


Proposition 6.6.6 
If geint (D(%)), then J+(H) Nn J-(qg) is compact or empty. O 
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‘To show that the whole of D(.) and not merely its interior is globally 
hyperbolic, one has to impose some extra conditions. 


Proposition 6.6.7 
If F is a closed achronal set such that J+(.Y)n J-(f) is both strongly 
causal and either 
(1) acausal (this is the case if and only if Y is acausal), or 
(2) compact, 
then D(Y) is globally hyperbolic. 


Suppose that strong causality did not hold at some point ge D(.F). 
Then by an argument similar to lemma 6.6.5, there would be an 
inextendible null geodesic through g at each point of which strong 
causality did not hold. This is impossible, since it would intersect /. 
Therefore strong causality holds on D(.). 

Ifp,qel-(S), the argument of proposition 6.6.3 holds. If peJ-(S), 
qeJ*(F) one can as in proposition 6.6.3 construct a future-directed 
limit curve A from p and a past-directed limit curve A from q, and 
choose a subsequence A”,, which converges to r for every point 7 on 
A or A. In case (1), A would intersect Y in a single point x. Any neigh- 
bourhood of x would contain points of A", for n sufficiently large, and 
so would contain «”,,, defined as A”, nN ¥, since Y is achronal. Therefore 
a”, would converge to x. Similarly x”,, would converge to 2 =An &. 
Thus # = x and so one could join A and A to give a non-spacelike limit 
curve in O(p, q). 

In case (2), suppose that A did not have a future endpoint at q. 
Then A would leave J~(S) since it would intersect Y and by proposi- 
tion 6.4.7 it would have to leave the compact set J+(SY) ) J-(S). Thus 
one could find a point x on A which was not in J-(#). For each n, 
choose a point x”,E Sn A",. Since F is compact, there will be some 
point yc and a subsequence A”, such that the corresponding points 
a”, converge to y. Suppose that.y does not lie on A. Then for suffi- 
ciently large n each x",, would lie to the future of any neighbourhood 
UH of x. This would imply xe J~(SF). This is impossible as x is in J*(.F) 
but is not in the compact set J+(S)n J-(F). Therefore A would pass 
through y. Similarly A would pass through y. One could then join them 
to obtain a limit curve. oO 


Proposition 6.6.3 shows that the existence of a Cauchy surface for an 
open set / implies global hyperbolicity of 4”. The following result 
shows that the converse is also true: 
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Proposition 6.6.8 (Geroch (1970b)) 

If an open set./ is globally hyperbolic, then /, regarded as a mani- 
fold, is homeomorphic to Rix SY where / is a three-dimensional 
manifold, and for each ae R}, {a} x F is a Cauchy surface for /. 


As in proposition 6.4.9, put a measure w on such that the total 
volume of .W in this measure is one. For pe.V define f*(p) to be the 
volume of J+(p,W%) in the measure yw. Clearly ft(p) is a bounded 
function on which decreases along every future-directed non- 
spacelike curve. We shall show that global hyperbolicity implies that 
f+(p) is continuous on / so that we do not need to ‘average’ the 
volume of the future as in proposition 6.4.9. To do this it will be suffi- 
cient to show that f+(p) is continuous on any non-spacelike curve A. 

Let reA and let x, be an infinite sequence of points on A strictly to 
the past ofr. Let F be Q J*(x,, WV ). Suppose that f+(p) was not upper 


semi-continuous on A at r. There would be a point ge F —J*(r,.4°). 
Then r¢J-(q, /); but each x, €J-(q,.”) and so reJ—(q,-V), which 
is impossible as J~(q, 4) is closed in 4” by proposition 6.6.1. The 
proof that it is lower semi-continuous is similar 

As p is moved to the future along an inextendible non-spacelike 
curve A in. the value of f*(p) must tend to zero. For suppose there 
were some point g which lay to the future of every point ofA. Then the 
future-directed curve A would enter and remain within the compact 
set J*(r) J~(q) for any 7 € A which would be impossible by proposition 
6.4.7 as the strong causality condition holds on ./. 

Now consider the function f(p) defined on.” by f(p) = f-(p)/f *(p). 
Any surface of constant f will be an acausal set and, by proposition 
6.3.1, will be a three-dimensional C!~ manifold imbedded in. {”. It will 
also be a Cauchy surface for.” since along any non-spacelike curve, 
f~ will tend to zero in the past and f+ will tend to zero in the future. 
One can put a timelike vector field V on. {° and define a continuous 
map # which takes points of./~ along the integral curves of V to where 
they intersect the surface ¥ (f = 1). Then (logf(p). B(p)) is a homeo- 
morphism of 4 onto Rk x /. If one smoothed f as in proposition 6.4.9, 
one could improve this to a diffeomorphism. Oo 


Thus if the whole of space-time were globally hyperbolic. i.e. if there 
were a global Cauchy surface, its topology would be very dall, 
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6.7 The existence of geodesics 


The importance of global hyperbolicity for chapter 8 lies in the 
following result: 


Proposition 6.7.1 

Let p and qlie ina globally hyperbolic set.V with qe J *(p). Then there 
is a non-spacelike geodesic from p to g whose length is greater than or 
equal to that of any other non-spacelike curve from p to q. 


Almost broken almost null 
curve from p to gin Y 


Ficure 46. Y is an open neighbourhood of the timelike curve A from p to q. 
There exist in Y timelike curves from p to g which approximate broken null 
curves and are of arbitrarily small length. 


We shall present two proofs of this result: the first, due to Avez (1963) 
and Seifert (1967), isan argument from the compactness of C(p, q), and 
the second (applicable only when ./ is open) is a procedure whereby 
the actual geodesic is constructed. 

The space C(p,q) contains a dense subset C’(p,q) consisting of all 
the timelike C! curves from p to g. The length of one of these curves A is 
defined (cf. § 4.5) as 


ap = | . (—g(d/at, a/at))* at, 


where ¢ is a C! parameter on A. The function L is not continuous on 
C'(p,q) since any neighbourhood of A contains a zig-zag piecewise 
almost null curve of arbitrarily small length (figure 46). This lack of 
continuity arises because we have used the C° topology which says that 
two curves are close if their points in -@, but not necessarily their 
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tangent vectors, are close. We could put a C! topology on C’(p, q) and 
so make L continuous but we do not do this because C’(p, q) is not 
compact; one gets a compact space only when one includes all the 
continuous non-spacelike curves. Instead, we use the C° topology and 
extend the definition of L to C(p,q). 

Because of the signature of the metric, putting wiggles in a timelike 
curve reduces its length. Thus Z is not lower semi-continuous. However 
one has: 


Lemma 6.7.2 
Lis upper semi-continuous in the C® topology on C’(p, q). 


Consider a C1 timelike curve A(t) from p to g, where the parameter ¢ is 
chosen to be the arc-length from p. In a sufficiently small neighbour- 
hood Y of A, one can find a function f which is equal to ¢t on A and is 
such that the surfaces {f = constant} are spacelike and orthogonal to 
6/dt (i.e. g@f.,|, = (2/ét)*). One way to define such an f would be to 
construct the spacelike geodesics orthogonal to A. For a sufficiently 
small neighbourhood % of A, they will give a unique mapping of Y to A, 
and the value of f at a point in Y can be defined as the value of t at the 
point on A into which it is mapped. Any curve y in Y can be para- 
metrized by f. The tangent vector (@/éf), to ~ can be expressed as 


(5),) gh geek, 


where k is a spacelike vector lying in the surface {f = constant}, i.e. 
kf, , = 0. Then 


((a) (5) ) = 9" Ft ads0+ Janko 


é he 


2 ad GUre: 
However on A, gf. f,, = — 1. Thus given any € > 0, one can choose 
U’ = U sufficiently small that on @’, gf. f., > —1+.¢. Therefore for 
any curve # in &’, Lp] < (146) 201. a 


We now define the length of a continuous non-spacelike curve A from 
p to q as follows: let YW be a neighbourhood of A in. # and let (WZ) be 
the least upper bound of the lengths of timelike curves in Y from Pp 
to q. Then we define L[A] as the greatest lower bound of I(%) for all 
neighbourhoods ¥ of A in. #. This definition of length will work for all 
curves A from p tog which havea (! timelike curve in every neighbour- 
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hood, i.e. it will work for all points in C(p, q) which lie in the closure of 
C'(p,q). By § 4.5, a non-spacelike curve from p to q which is not an 
unbroken null geodesic curve can be varied to give a piecewise C1 
timelike curve from p to q, and the corners of this curve can be 
rounded off to give a C! timelike curve from p to g. Thus points in 
C(p. q) — C'(p, q) are unbroken null geodesics (containing no conjugate 
points), and we define their length to be zero. 

This definition of Z makes it an upper semi-continuous function on 
the compact space C"(p,q). (Actually, as a continuous non-spacelike 
curve satisfies a local Lipschitz condition, it is differentiable almost 
everywhere. Thus the length could still be defined as 


| (-a¢elet, afm, 


and this would agree with the definition above.) If C’(p,q) is empty 
but C(p.q) is non-empty, p and gq are joined by an unbroken null 
geodesic and there are no non-spacelike curves from p to g which are 
not unbroken null geodesics. If C’(p, ¢) is non-empty, it will contain 
some point at which ZL attains its maximum value, i.e. there will be 
a non-spacelike curve y from p to g whose length is greater than or 
equal to that of any other such curve. By proposition 4.5.3, y must be 
a geodesic curve as otherwise one could find points x, yey which lay 
in a convex normal coordinate neighbourhood and which could be 
joined by a geodesic segment of greater length than the portion of y 
between x and y. Oo 


For the other. constructive, proof, we first define d(p,q) for p,ge-4 
to be zero if ¢¢.J *(p) and otherwise to be the least upper bound of the 
lengths of future-directed piecewise non-spacelike curves from p to q. 
(Note that d(p,q) may be infinite.) For sets Y and Y, we define 
d(S, U%) to be the least upper bound of d(p,q), peY, ¢ €%. 

Suppose ge1+(p) and that d(p,q) is finite. Then for any 6 > 0 one 
can find a timelike curve A of length d(p,q)— 40 from p to g and a 
neighbourhood Y of ¢g such that A can be deformed to give a timelike 
curve of length d(p, q) — 6 from p to any point re Y%. Thus d(p, q), where 
finite, is lower semi-continuous. In general d(p,q) is not upper semi- 
continuous but: 


Lemma 6.7.3 
d(p,q) is finite and continuous in p and g when p and q are contained 


in a globally hyperbolic set . 
8-2 
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We shall first prove d(p, q) is finite. Since strong causality holds on the 
compact set J+(p)n J~(q), one can cover it with a finite number of 
local causality sets such that each set contains no non-spacelike curve 
longer than some bound e. Since any non-spacelike curve from p to g 
can enter each neighbourhood at most once, it must have finite length. 

Now suppose that for p,ge.V, there is a 6 > 0 such that every 
neighbourhood of g contains a point reV such that 


d(p,r) > d(p,q) +0. 


Let x, be an infinite sequence of points in. VW converging to q such that 
d(p,x,) > d(p,q) +6. Then from each x, one can find a non-spacelike 
curve A,, to p of length > d(p,q)+6. By lemma 6.2.1 there will be 
a past-directed non-spacelike curve A through g which is a limit curve 
of the A,,. Let Y be a local causality neighbourhood of g. Then A cannot 
intersect [~(q)n W since if it did one of the A,, could be deformed to 
give a non-spacelike curve from p to qg of length > d(p,q). ThusAn YW 
must be a null geodesic from g and at each point x of An Y. d(p, x) will 
have a discontinuity greater than 6. This ‘argument can be repeated 
to show that A is a null geodesic and at each point x€A, d(p,x) has 
a discontinuity greater than 6. This shows that A cannot have an end- 
point at p, since by proposition 4.5.3, d(p,x) is continuous on a local 
causality neighbourhood of p. On the other hand, A would hg inextend- 
ible in .# —p and so if it did not have an endpoint at p, it would have 
to leave the compact set J+(p) n J~(q) by proposition 6.4.7. This shows 
that d(p,q) is upper semi-continuous on. O 


In the case that .4” is open, one can easily construct the geodesic of 
maximum length from p to g by using the distance function. Let 
U— N bea local causality neighbourhood of p which does not contain 
q and let xe J*(p)n J~(q) be such that d(p,r)+d(r,q), re®%, is maxi- 
mized for r = x. Construct the future-directed geodesic y from p 
through x. The relation d(p, r)+d(r,q) = d(p,q) will hold for all points 
ron y between p and x. Suppose there were a point y €J~(¢) —¢ which 
was the last point on y at which this relation held. Let ” <.W be 
a local causality neighbourhood of y which does not contain g and 
let zeJ t(y)n J-(q)n W be such that d(y,r)+d(r,q). re¥ , attains its 
maximum value d(y, g) for r = z. If z did not lie on y, then 


d(p,z) > d(p,y)+d(y,z) and d(p,z)+d(z,q) > d(p,q) 
which is impossible. This shows that the relation 


d(p,r)+d(r,q) = d(p,¢) 
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must hold for all reyn J~(q). As J+(p)n J~(q) is compact, y must 
leave J~(q) at some point y. Suppose y + q; then y would lie on a 
past-directed null geodesic A from q. Joining y to A would give a non- 
spacelike curve from p to gq which could be varied to give a curve 
longer than d(p,q), which is impossible. Thus y is a geodesic curve 
from p to q of length d(p,q). Oo 


Corollary 

If F is a C? partial Cauchy surface, then to each point ge D+(S) 
there is a future-directed timelike geodesic curve orthogonal to ¥ of 
length d(%,q), which does not contain any point conjugate to # 
between ¥ and q. 


By proposition 6.5.2, H+(.S) and H~(F) do not intersect FY and so are 
not in D(/). Thus D(S) = int D() is globally hyperbolic by proposi- 
tion 6.6.3. By proposition 6.6.6, / 1 J~(q) is compact and so d(p, q), 
peF, will attain its maximum value of d(.Y,q) at some point re. 
There will be a geodesic curve y from 7 to q of length d(Y, q) which by 
lemma 4.5.5 and proposition 4.5.9 must be orthogonal to Y and not 
contain a point conjugate to Y between ¥ and q. 0 


6.8 The causal boundary of space-time 


In this section we shall give a brief outline of the method of Geroch, 
Kronheimer and Penrose (1972) for attaching a boundary to space— 
time. The construction depends only on the causal structure of (4, g). 
This means that it does not distinguish between boundary points at a 
finite distance (singular points) and boundary points at infinity. In 
§ 8.3 weshall describe a different construction which attaches a bound- 
ary which represents only singular points. Unfortunately there does 
not seem to be any obvious relation between the two constructions. 

We shall assume that (4, §) satisfies the strong causality condition. 
Then any point p in (-4, g) is uniquely determined by its chronological 
past I-(p) or its future I*(p), i.e. 


a) a) — 1g) p49. 
The chronological past W =I-(p) of any point pe.# has the 
properties: 


(1) W is open; 
(2) W isa past set,ie. I-(W) <W; 
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(3) W cannot be expressed as the union of two proper subsets 

which have properties (1) and (2). 
We shall call a set with properties (1), (2) and (3) an indecomposable 
past set, abbreviated as IP. (The definition given by Geroch, Kron- 
heimer and Penrose does not include property (1). However it is 
equivalent to the definition given here, since by ‘a past set’ they mean 
a set which equals its chronological past, rather than merely con- 
taining it.) One can define an IF, or indecomposable future set, similarly. 

One can divide IPs into two classes: proper [Ps (PIPs) which are 
the pasts of points in -@, and terminal IPs (TIPs) which are not the 
past of any point in -4. The idea is to regard these TIPs and the 
similarly defined TIF s as representing points of the causal boundary 
(c-boundary) of (-#, 8). For instance, in Minkowski space one would 
regard the shaded region in figure 47 (i) as representing the point p 
on “%*. Note that in this example, the whole of -@ is itself a TIP and 
also a TIF’. These can be thought of as representing the points 7+ and i- 
respectively. In fact all the points of the conformal boundary of 
Minkowski space, except 2°, can be represented as TIPs or TIFs. In 
some cases, such as anti-de Sitter space, where the conformal boundary 
is timelike, points of the boundary will be represented by both a TIP 
and a TIF (see figure 47 (ii)). 

One can also characterize TIPs as the pasts of future-inextendible 
timelike curves. This means that one can regard the past I-(y) of 
a future-inextendible curve y as representing the future endpoint of y 
on the c-boundary. Another curve y’ has the same endpoint if and 
only thi=(y) = ay) 


Proposition 6.8.1 (Geroch, Kronheimer and Penrose) 


A set W is a TIP if and only if there is a future-inextendible timelike 
curve y such that I-(y) = W. 


Suppose first that there is a curve y such that I-(y)=W. Let 
W=UUV where UW and ¥ are open past sets. One wants to show 
that either Y is contained in Y, or VY contained in ZY. Suppose that, 
on the contrary, Y is not contained in ¥ and W not contained in UM 
Then one could find a point g in YW—¥Y and a point rinW —Y. Now 
q,r€I~(y), so there would be points q’,r’ ey such that qel-(q’) and 
reI-(r’). But whichever of Y or Y contained the futuremost of Cat 
would also contain both g and 7, which contradicts the original defini- 
tions of g and r. 
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TIF representing 
point p 


TIP representing 
point p 


(11) 


Figure 47. Penrose diagrams of Minkowski space and anti-de Sitter space (cf. 
figures 15 and 20), showing (i) the TIP representing a point p on.%*+ in Minkowski 
space, and (ii) the TIP and the TIF representing a point p on ¥ in anti-de Sitter 
space. 


Conversely, suppose W is a TIP. Then one must construct a time- 
like curve y such that W = I-(y). Now if p is any point of W, then 
W =I-W 0 I*(p)) UI-(W —I*(p)). However W is indecomposable, 
so either W = I-(Wn I*(p)) or W = 1-(W —I*(p)). The point p is 
not contained in 1-(W — I*(p)), so the second possibility is eliminated. 
The conclusion may be restated in the following form: given any pair 
of points of WY, then W contains a point to the future of both of them. 
Now choose a countable dense family p,, of points of W’. Choose a point 
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dy in W to the future of p,. Since gj and p, are in W, one can choose 
a point g,in W to the future of both of them. Since g, and p, arein WY, 
one can choose q, in W to the future of both of them, and so on. Since 
each point q, obtained in this way lies in the past of its successor, one 
can find a timelike curve yin W through all the points of the sequence. 
Now for each point peW, the set W n J*(p) is open and non-empty, 
and so it must contain at least one of the p,,, since these are dense. 
But for each k, p, lies in the past of g,, whence p itself lies in the past 
of y. This shows that every point of W lies to the past of y, and so 
since y is contained in the open past set W, one must have 
t= lay). oO 


We shall denote by AM the set of all IPs of the space (.@, 8). Then M&M 
represents the points of plus a future c-boundary; similarly, .4, the 
set of all IF's of (-4, 8), represents @ plus a past c-boundary. One can 
extend the causal relations I, J and E to .@ and -@ in the following 
way. For each Y, V < &, we shall say 


Uel-V,M) ££ UCY, 
Ucl-(V,M) if UCI-(q) for some point geV, 
Uch-(V,M) if UeI-V,M) butnot Wel-(V,.A). 


With these relations, the IP-space AM is a causal space (Rronheimer 
and Penrose (1967)). There is a natural injective map [-: #@> M 
which sends the point pe.# into I-(p)e4. This map is an iso- 
morphism of the causality relation J- as peJ-(q) if and only if 
L-(p)eJ-I-(q), My. The causality relation is preserved by J- but not 
by its inverse, i.e. pel-(q) > I-(p)el-(I-(q), ). One can define 
causal relations on .@ similarly. 

The idea now is to write .# and in some way to form a space @* 
which has the form .-@ U A where A will be called the c-boundary of 
(4,8). To do so, one needs a method of identifying appropriate IPs 
and IF's. One starts by forming the space .W# which is the union of 
M and .#, with each PIF identified with the corresponding PIP. In 
other words, .@# corresponds to the points of # together with the 
TIPs and TIFs. However as the example of anti-de Sitter space shows, 
one also wants to identify some TIPs with some TIFs. One way of 
doing this is to define a topology on .@*, and then to identify some 
points of .@* to make this topology Hausdorff. 

As was mentioned in § 6.4, a basis for the topology of the topological 
space -@ is provided by sets of the form J *(p) 0 I-(q). Unfortunately 
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one cannot use a similar method to define a basis for the topology of 
M* as there may be some points of .@# which are not in the chrono- 
logical past of any points of .W#. However one can also obtain a 
topology of -@ from a sub-basis consisting of sets of the form I+(p), 

1-@), Be Te *(p) and @— iz (p). Following this analogy, Geroch, 

Kromheimer and Penrose have shown how one can define a topology 
on .##. For an IF ./e€.M, one defines the sets 


SH=VA:V EM maV I A + St, 
and Ot Sy: Vem maVv =I- (W)> TW) € H}. 


For an IP Ze.M, the sets Z™ and Best are defined similarly. The 
open sets of .W* are then defined to be the unions and finite inter- 
sections of sets of the form ./™t, ofext, Bint and Fert. The sets .V7™ and 
ZB are the analogues in-@* of the sets [+(p) and J~(q). If in particular 
Af = I*(p) and VW = I-~(q) then W € s¥™ if and only if ge I*(p). 
However the definitions enable one also to incorporate TIPS into 
Jint| The sets o/*t and Zest are the analogues of M—I*(p) and 
&M —I-(q). 

Finally one obtains &* by identifying the smallest number of points 
in the space @#* necessary to make it a Hausdorff space. More precisely 
M* is the quotient space .4*/R, where RF, is the intersection of all 
equivalence relations R < @*x.4#* for which -@#/R is Hausdorff. 
The space .@* has a topology induced from .@*# which agrees with the 
topology of -# on the subset -@ of -@*. In general one cannot extend 
the differentiable structure of @ to A, though one can on part of A in 
a special case which will be described in the next section. 


6.9 Asymptotically simple spaces 


In order to study bounded physical systems such as stars, one wants 
to investigate spaces which are asymptotically flat, i.e. whose metrics 
approach that of Minkowski space at large distances from the system. 
The Schwarzschild, Reissner-Nordstrom and Kerr solutions are 
examples of spaces which have asymptotically flat regions. As we saw 
in chapter 5, the conformal structure of null infinity in these spaces is 
similar to that of Minkowski space. This led Penrose (1964, 19656, 
1968) to adopt this as a definition of a kind of asymptotic flatness. We 
shall only consider strongly causal spaces. Penrose does not make the 
requirement of strong causality. However it simplifies matters and im- 
plies no loss of generality in the kind of situation we wish to consider. 
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A time-and space-orientable space (-@, 8) is said to be asymptotically 
simple if there exists a strongly causal space (.@, &) and an imbedding 
6: &-> M which imbeds -M as a manifold with smooth boundary 04 
in &@, such that: » 

(1) there is a smooth (say C? at least) function Q on @ such that on 
6(.M), Qis positive and Q2g = 0,(§%) (i.e. & is conformal to g on 6(-4)); 

(2) on 0.4, OQ = Oand dQ + 0; 

(3) every null geodesic in has two endpoints on 24. 

We shall write # U aM = UM. 

In fact this definition is rather more general than one wants since 
it includes cosmological models, such as de Sitter space. In order to 
restrict it to spaces which are asymptotically flat spaces, we will say 
that a space (.W, 8) is asymptotically empty and simple if it satisfies 
conditions (1), (2), and (3), and 

(4) R,,, = 0 on an open neighbourhood of @ in Z. (This condition 
can be modified to allow the existence of electromagnetic radiation 
near 0.@). ’ 

The boundary 0.4 can be thought of as being at infinity, in the 
sense that any affine parameter in the metric g on a null geodesic in -# 
attains unboundedly large values near 0.M. This is because an affine 
parameter v in the metric g is related to an affine parameter # in the 
metric & by dv/dé = Q-?. Since Q = Oat 24, [dv diverges? 

From conditions (2) and (4) it follows that the boundary 2 is a 
null hypersurface. This is because the Ricci tensor #,, of the metric 
Jay is related to the Ricci tensor R,, of g,, by 

ie a OR? - 207"(Q) ae G”” oe { ~ 07 Og + 302-0 c Oak gts,” 
where | denotes covariant differentiation with respect to g,,. Thus 
R= O?°R-6070,,494 + 3070,,0,49%4. 
Since the metric j,, is C3, R is C! at @W where Q = 0. This implies 
that Q,,Q.,9°% = 0. However by condition (2), Q,, + 0. Thus Q,,9° is 
a null vector, and the surface ¢.W (Q = 0) is a null hypersurface. 

In the case of Minkowski space, 0. consists of the two null surfaces 
F* and F—, each of which has the topology R! x S?. (Note that it does 
not include the points 7°, 7+ and i~ since the conformal boundary is not 
a smooth manifold at these points.) We shall show that in fact 0.W has 
this structure for any asymptotically simple and empty space. 

Since é.@ is a null surface, .# lies locally to the past or future of it. 
This shows that @M must consist of two disconnected components: 
F* on which null geodesics in -W have their future endpoints, and .4- 
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on which they have their past endpoints. There cannot be more than 
two components of 0M, since there would then be some point pe# 
for which some future-directed null geodesics would go to one com- 
ponent and others to another component. The set of null directions 
at p going to each component would be open, which is impossible, 
since the set of future null directions at p is connected. 

We next establish an important property. 


Lemma 6.9.1 
An asymptotically simple and empty space (./, g) is causally simple. 


Let # be a compact set of . One wants to show that every null 
geodesic generator of J+(W) has past endpoint at W. Suppose there 
were a generator that did not have endpoint there. Then it could not 
have any endpoint in .4, so it would intersect .4-, which is im- 
possible. O 


Proposition 6.9.2 
An asymptotically simple and empty space (4,8) is globally 
hyperbolic. 


The proof is similar to that of proposition 6.6.7. One puts a volume 
element on.@ such that the total volume of 4 in this measure is unity. 
Since (.%, 8) is causally simple, the functions f+(p), f-(p) which are 
the volumes of [+(p), 7-(p) are continuous on -@. Since strong causality 
holds on .4, f*(p) will decrease along every future-directed non- 
spacelike curve. Let A be a future-inextendible timelike curve. Sup- 
pose that ¥ = (\ I*(p) was non-empty. Then F would be a future set 


and the null semen of the boundary of ¥ in-@ would have no past 
endpoint in &. Thus they would intersect .%~, which again leads to 
a contradiction. This shows that f+(p) goes to zero as p tends to the 
future on A. From this it follows that every inextendible non-spacelike 
curve intersects the surface % = {p: f+(p) = f-(p)}, which is therefore 
a Cauchy surface for -@. Oo 


Lemma 6.9.3 

Let W be a compact set of an asymptotically empty and simple space 
(@, 8). Then every null geodesic generator of 4* intersects J +(W,M) 
once, where * indicates the boundary in .@. 

Let peA, where A is a null geodesic generator of +. Then the past set 
(in .@) J-(p, W@)0 M must be closed in -#, since every null geodesic 
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generator of its boundary must have future endpoint on Yt at p. 
Since strong causality holds on dM, M —JI~(p, M) will be non-empty. 
Now suppose that A were contained in J*(W, Md). Then the past set 
1 (J-(p, @)n M) would be non-empty. This would be impossible, 
pea 


since the null generators of the boundary of the set would intersect .#*. 

Suppose on the other hand that A did not intersect J1(W, 4M). Then 

M —\J (J-(p, #)n M) would be non-empty. This would again lead 
a 


pe 
to a contradiction, as the generators of the boundary of the past set 
es (J-(p, M@) M) would intersect 4+. D 


Corollary 

J+ is topologically R! x (J+(W, M)n 0M). 

We shall now show that .%*+ (and .%-) and @ are the same topologically 
as they are for Minkowski space. 


Proposition 6.9.4 (Geroch (1971)) 

In an asymptotically simple and empty space (.W, 8), %* and ¥%~ are 
topologically A! x S?, and @ is Ft. 

Consider the set N of all null geodesics in .@. Since these all intersect 
the Cauchy surface #, one can define local coordinates on by the 
local coordinates and directions of their intersections with #. This 
makes N into a fibre bundle of directions over # with fibre S?. How- 
ever every null geodesic also intersects “+. Thus WN is also a fibre 
bundle over .4+. In this case, the fibre is S? minus one point which 
corresponds to the null geodesic generator of .%+ which does not enter 
-#. In other words, the fibre is R?. Therefore N is topologically 
I+ x R2, However “+ is Rx (J+(W, HM) &M). This is consistent 
with N = # x8? only if H ~ R* and 4+ = Rix S?. oO 


Penrose (19656) has shown that this result implies that the Weyl 
tensor of the metric § vanishes on .7+ and ¥—-. This can be interpreted 
as saying that the various components of the Weyl tensor of the 
metric § ‘peel off’, that is, they go as different powers of the affine 
parameter on a null geodesic near + or .4—-. Further Penrose (1963), 
Newman and Penrose (1968) have given conservation laws for the 
energy-momentum as measured from .¥*, in terms of integrals on +. 

The null surfaces .~+ and .£~ form nearly all the c-boundary A of 
(.@, 8) defined in the previous section. To see this, note first that any 
point pe.%+ defines a TIP I-(p,.@)n mM. Suppose A is a future- 
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inextendible curve in -@. If A has a future endpoint at pe.4*, then the 
TIP J-(A) is the same as the TIP defined by p. If A does not have a 
future endpoint on .7+, then # — I~(A) must be empty, since if it were 
not, the null geodesic generators of /-(A) would intersect .4+ which is 
impossible as A does not intersect “+. The TIPs therefore consist of 
one for each point of .4+, and one extra TIP, denoted by 2+, which is 
AH itself. Similarly, the TIFs consist of one for each point of .4-, and 
one, denoted by 7-, which again is -@ itself. 

One now wants to verify that one does not have to identify any 
TIPs or TIFs, i.e. that /* is Hausdorff. It is clear that no two TIPs 
or TI's corresponding to .7+ or .¥~- are non-Hausdorff separated. If 
pes* then one can find ge.W such that p¢I*(q,.Z). Then 
(I+(q, @))*** is a neighbourhood in.# of the TIP I~(p,.7)n -M, and 
(I+(q, #@))™* is a disjoint neighbourhood of the TIP i+. Thus i+ is 
Hausdorff separated from every point of %*. Similarly it is Hausdorff 
separated from every point of .4-. Thus the c-boundary of any 
asymptotically simple and empty space (.@, §) is the same as that of 
Minkowski space-time, consisting of 4+, %- and the two points 7+, i-. 

Asymptotically simple and empty spaces include Minkowski space 
and the asymptotically flat spaces containing bounded objects such as 
stars which do not undergo gravitational collapse. However they do 
not include the Schwarzschild, Reissner—Nordstr6m or Kerr solutions, 
because in these spaces there are null geodesics which do not have 
endpoints on “> or £—. Nevertheless these spaces do have asympto- 
tically flat regions which are similar to those of asymptotically empty 
and simple spaces. This suggests that one should define a space (4, g) 
to be weakly asymptotically simple and empty if there is an asymp- 
totically simple and empty space (.@’, 8’) and a neighbourhood @’ of 
0M! in M' such that Yn @’ is isometric to an open set Y of 4. This 
definition covers all the spaces mentioned above. In the Reissner— 
Nordstrém and Kerr solutions there is an infinite sequence of asymp- 
totically flat regions Y which are isometric to neighbourhoods &’ of 
asymptotically simple spaces. There is thus an infinite sequence of 
null infinities %*+ and %—. However we shall consider only one asymp- 
totically flat region in these spaces. One can then regard (4,8) as 
being conformally imbedded in a space (4, &) such that a neighbour- 
hood Y of é.4 in M@ is isometric to Y’. The boundary é.@ consists of 
a single pair of null surfaces .%* and #~. 

We shall discuss weakly asymptotically simple and empty spaces 
in §9.2 and § 9.3. 


7 
The Cauchy problem in General Relativity 


In this chapter we shall give an outline of the Cauchy problem in 
General Relativity. We shall show that, given certain data on a space- 
like three-surface Y, there is a unique maximal future Cauchy 
development Dt(/) and that the metric on a subset Y of Dt(S) 
depends only on the initial data on J-(Y)n #. We shall also show 
that this dependence is continuous if Y hasa compact closure in D*(S). 
This discussion is included here because of its intrinsic interest, 
because it uses some of the results of the previous chapter, and because 
it demonstrates that the Einstein field equations do indeed satisfy 
postulate (a) of §3.2 that signals can only be sent between points that 
can be joined by a non-spacelike curve. However it is not really needed 
for the remaining three chapters, and so could be skipped by the 
reader more interested in singularities. 

In §7.1, we discuss the various difficulties and give a precise formu- 
lation of the problem. In §7.2 we introduce a global background 
metric & to generalize the relation which holds between the Ricci 
tensor and the metric in each coordinate patch to a single relation 
which holds over the whole manifold. We impose four gauge conditions 
on the covariant derivatives of the physical metric g with respect to 
the background metric 8. These remove the four degrees of freedom 
to make diffeomorphisms of a solution of Einstein’s equations, and 
lead to the second order hyperbolic reduced Einstein equations for g 
in the background metric 8. Because of the conservation equations, 
these gauge conditions hold at all times if they and their first deriva- 
tives hold initially. 

In §7.3 we show that the essential part of the initial data for § on 
the three-dimensional manifold Y can be expressed as two three- 
dimensional tensor fields h®, y*® on Y. The three-dimensional mani- 
fold / is then imbedded in a four-dimensional manifold . and a 
metric g is defined on Y such that 47 and y* become respectively the 
first and second fundamental forms of Y in §. This can be done in such 
a way that the gauge conditions hold on Y. In § 7.4 we establish some 

[ 226 ] 
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basic inequalities for second order hyperbolic equations. These relate 
integrals of squared derivatives of solutions of such equations to their 
initial values. These inequalities are used to prove the existence and 
uniqueness of solutions of second order hyperbolic equations. In §7.5 
the existence and uniqueness of solutions of the reduced empty space 
Einstein equations is proved for small perturbations of an empty space 
solution. The local existence and uniqueness of empty space solutions 
for arbitrary initial data is then proved by dividing the initial surface 
up mto small regions which are nearly flat, and then joining the 
resulting solutions together. In §7.6 we show there is a unique maximal 
empty space solution for given initial data and that in a certain sense 
this solution depends continuously on the initial data. Finally in §7.7 we 
indicate how these results may be extended to solutions with matter. 


7.1 The nature of the problem 


The Cauchy problem for the gravitational field differs in several 
important respects from that for other physical fields. 

(1) The Einstein equations are non-linear. Actually in this respect 
they are not so different from other fields, for while the electromagnetic 
field, the scalar field, etc., by themselves obey linear equations in a given 
space-time, they form a non-linear system when their mutual inter- 
actions are taken into account. The distinctive feature of the gravita- 
tional field is that it is self-interacting: it is non-linear even in the 
absence of other fields. This is because it defines the space-time over 
which it propagates. To obtain a solution of the non-linear equations 
one employs an iterative method on approximate linear equations 
whose solutions are shown to converge in a certain neighbourhood of 
the initial surface. 

(2) Two metrics §, and g,0na manifold 4 are physically equivalent 
if there is a diffeomorphism ¢:.W@—>-W which takes §, into §>» 
($481 = &2), and clearly g, satisfies the field equations if and only if 8, 
does. Thus the solutions of the field equations can be unique only up to 
a diffeomorphism. In order to obtain a definite member of the equiva- 
lence class of metrics which represents a space-time, one introduces 
a fixed ‘background’ metric and imposes four ‘gauge conditions’ on 
the covariant derivatives of the physical metric with respect to the 
background metric. These conditions remove the four degrees of 
freedom to make diffeomorphisms and lead to a unique solution for 
the metric components. They are analogous to the Lorentz condition 
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which is imposed to remove the gauge freedom for the electromagnetic 
field. 

(3) Since the metric defines the space-time structure, one does not 
know in advance what the domain of dependence of the initial surface 
is and hence what the region is on which the solution is to be deter- 
mined. One is simply given a three-dimensional manifold / with 
certain initial data w on it, and is required to find a four-dimensional 
manifold .#, an imbedding 0: S>.M and a metric § on -@ which 
satisfies the Einstein equations, agrees with the initial values on 6(.7) 
and is such that 6(Y) is a Cauchy surface for @. We shall say that 
(4,0, &), or simply 4, is a development of (/, w). Another develop- 
ment (.4', 0’, 8’) of (FY, w) will be called an extension of -@ if there is 
a diffeomorphism « of “ into -#’ which leaves the image of Y point- 
wise fixed and takes g’ into § (i.e. 0-10-10’ = idon ¥ and a, §’ = $). 
We shall show that provided the initial data w satisfies certain 
constraint equations on F, there will exist developments of (Y, w) and 
further, there will be a development which is maximal in the sense 
that it is an extension of any development of (.Y,w). Note that by 
formulating the Cauchy problem in these terms we have included the 
freedom to make diffeomorphisms, since any development is an 
extension of any diffeomorphism of itself which leaves the image of Y 
pointwise fixed. . 


7-2 The reduced Einstein equations 


In chapter 2, the Ricci tensor was obtained in terms of coordinate 
partial derivatives of the components of the metric tensor. For the 
purposes of this chapter it will be convenient to obtain an expression 
that applies to the whole manifold -W and not just to each coordinate 
neighbourhood separately. To this end we introduce a background 
metric 8 as well as the physical metric g. With two metrics one has to 
be careful to maintain the distinction between covariant and contra- 
variant indices. (To avoid confusion, we shall suspend the usual con- 
ventions for raising and lowering indices.) The covariant and contra- 
variant forms of g and @ are related by 


9°40 = 3. = OF : Gey 


It will be convenient to take the contravariant form g® of the metric 
to be more fundamental and the covariant form Jap aS derived from it 
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by (7.1). Using the alternating tensor 9,,,4 defined by the background 
metric, this relation can be expressed explicitly as 


1 . , 
Jap = 319 9°79" (det 8) Faces Pears» (7.2) 
1 Pa 
where (det g)71 = ra 970g 4997 cet Nears 


is the determinant of the components of g” in a basis which is ortho- 
normal with respect to the metric g. 

The difference between the connection T defined by g and the con- 
nection I defined by § is a tensor, and can be expressed in terms of the 
covariant derivative of g with respect to T' (cf §3.3): 


na = 14,4. 
= 4974 (902 9039 — 94: 9%. 8%; — Ges 8%, 8%), (7.3) 
where we have used a stroke to denote covariant differentiation with 


respect to f and the symbol 6 to denote the difference between 
quantities defined from g and §. Then from (2.20), 


Ol Ol — Oe cap FOL gp OU as — 0! “ae O! “na: (7.4) 
Thus 


6(RY — kg®R) = g%gridR,, + 2dg%ag Hit, — dgt*bgri Rn, 
— fdg™R — 4g%(dg' R.,; +g 6R;;) 
= hytidg™ 5 — Gey, + 49 (Wis — Fea 2093) 


+(terms in 6g°%,, and dg‘), (7.5) 
yr? = 9-49 Gaegc = (det )-1 ((det 8) 9”), = (det g)16"",,_ (7.6) 
and ge = (det 8) dg”. 


The plan is now as follows. We choose some suitable background 
metric & and express the Einstein equations in the form 


R%® —1Rg® = 5(R®—4 Rg”) + Re —19%R = 87Jad. (77) 


One regards this as a second order non-linear set of differential equa- 
tions to determine ¢ in terms of the values of it and its first derivatives 
on some initial surface. Of course to complete the system one has to 
specify the equations governing the physical fields which make up the 
energy-momentum tensor 7’. However even when this is done one 
does not have a system of equations which uniquely determines the 
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time development in terms of the initial values and first derivatives. 
The reason for this is, as was mentioned above, that a solution of the 
Einstein equations can be unique only up to a diffeomorphism. In 
order to obtain a definite solution one removes this freedom to make 
diffeomorphisms by imposing four gauge conditions on the covariant 
derivatives of § with respect to the background metric g. We shall use 
the so-called ‘harmonic’ conditions 


y? I o, =0 
which are analogous to the Lorentz gauge conditions A*,; = 0 in 


electrodynamics. With this condition one obtains the reduced Einstein 
equations 


g'ig@.,+ (terms in d%, and 6%) = 1677 — 2Reb+ GR, (7.8) 


We shall denote the left-hand side of (7.8) by H®,2(6°%), where H@°., is 
the Einstein operator. For suitable forms of the energy-momentum 
tensor 7’ these are second order hyperbalic equations for which we 
shall demonstrate the existence and uniqueness of solutions in §7.5. 
We still have to check that the harmonic conditions are consistent 
with the Einstein equations. That is to say: we derived (7.8) from the 
Einstein equations by assuming that 6°, was zero. We now have to 
verify that the solution that (7.8) gives rise to does indeed have this 
property. To do this, differentiate (7.8) and contract. This gives an 
equation of the form 


GY gt Blt YW, +02 yr = 1677. ,, (7.9) 


where a semi-colon denotes differentiation with respect to g, and the 
tensors B,0* and C. depend on 9@, R,.4, 9? and g@ >. Equations (7.9) 
may be regarded as second order linear hyperbolic equations for 7°. 
Since the right-hand side vanishes, one can use the uniqueness 
theorem for such equations (proposition 7.4.5) to show that y will 
vanish everywhere if it and its first derivatives are zero on the initial 
surface. We shall see in the next section that this can be arranged by 
a suitable diffeomorphism. 

We still have to show that the unique solution obtained by imposing 
the harmonic gauge condition is related by a diffeomorphism to any 
other solution of the Einstein equations with the same initial data. 
This will be done in §7.4 by making a special choice of the background 
metric. 
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7.3 The initial data 


As (7.8) is a second order hyperbolic system it seems that to determine 
the solution one should prescribe the values of g® and g,.uon the 
initial surface O(/), where wu’ is some vector field which is not tangent 
to O(). However not all these twenty components are significant or 
independent: some can be given arbitrary initial values without 
changing the solution by more than a diffeomorphism, and others have 
to obey certain consistency conditions. 

Consider a diffeomorphism :.#—-W which leaves (SY) pointwise 
fixed. This will induce a map ”, which takes g® at p¢O(S) into a new 
tensor “49° at p. Ifn,€T*,, is orthogonal to 4(.S) (i.e. n, V% = 0 for 
any V*eT, tangent to 6(S)) and normalized so that n,g%n, = —1 
then, by suitable choice of x. n, “9” can be made equal to any vector 
at p which is not tangent to 6(.Y). Thus the components n,g™ are not 
significant. On the other hand as yw leaves 6(.) pointwise fixed, the 
induced metrich,, = 6*g,,0n SF will remain unchanged. It is therefore 
only this part of § which lies in 6(.Y) which need be given to determine 
the solution. The other components 7, g% can be prescribed arbitrarily 
without changing the solution by more than a diffeomorphism. 
Another way of seeing this is to recall that we formulated the Cauchy 
problem in terms of certain data on a disembodied three-manifold S 
and then looked for an imbedding into some four-manifold .@. Now 
on Y itself one cannot define a four-dimensional tensor field like § but 
only a three-dimensional metric h, which we shall take to be positive 
definite. The contravariant and covariant forms of h are related by 


h®h,, = 0%, (7.10) 
where now 6%, is a three-dimensional tensor in Y. The imbedding 0 will 
carry h,, into a contravariant tensor field 0,4” on 6(S) which has 


the property 1, O xh” = 0. (7.11) 


As n,g@ is arbitrary, one may now define g on (7) by 
g@® = 0h? —uru?, (7.12) 
where uw“ is any vector field on 6() which is nowhere zero or tangent 
to O(Y). Defining g,, by (7.1), one has: 
Nhen =O Gan. Ng7@ =—n, uu, g yur =—1. (7.13) 
Thus h,,, is the metric induced on / by g and u* is the unit vector 
orthogonal to 6(/) in the metric g. 
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The situation with the first derivatives g?),u° is similar: Ne Grice® 
can be given any value by suitable diffeomorphisms. However there is 
now an additional complication in that g*’|, depends not only on g§ but 
also on the background metric § on -@. In order to give a description 
of the significant part of the first derivative of g in terms only of tensor 
fields defined on Y, we proceed as follows. We prescribe a symmetric 
contravariant tensor field y2? on Y. Under the imbedding ye is 
mapped into a tensor field 0,77 on OF). We require that this is 
equal to the second fundamental form (see § 2.7) of the submanifold 
6(S) in the metric g. This gives 


OX? = Oh” 8 h(UGer) sa 
= 0 h”O 4 h?*((WGec)ia— 81 ca UGes)- (7.14) 
Using (7.3), one has 


Bx = $0 4h bY — 96:94; 9 eu + Goi c+ Gei¥'in)- (7-15) 
This may be inverted to give g*,,u° in terms of 6, x: 
497." = — Cees 6h 6 PAG (ta) +uew?, (7.16) 


where W?® is some vector field on 6(.Y). It can be given any required 
value by a suitable diffeomorphism vy. 

The tensor fields A% and y*® cannot be prescribedscompletely 
independently on “. For multiplying the Einstein equations (7.7) by 
n one obtains four equations which do not contain g” ,guu%, the 
second derivatives of ¢ out of Y. Thus there must be four relations 
between g%, gue and n,T. Using (2.36) and (2.35), they can be 
expressed as equations in the three-manifold 7: 


Xa ce — XA ehca = 800* (Ty, U*), (il?) 
4( Ri + (x Rae)? — NOX Machya) = 876* (Tutu), (7.18) 


where a double stroke || denotes covariant differentiation in Y with 
respect to the metric h, and R’ is the curvature scalar of h. 

The data w on Y that is required to determine the solution therefore 
consists of the initial data for the matter fields (in the case of a scalar 
field ¢ for example, this would consist of two functions on ¥ repre- 
senting the value of ¢ and its normal derivative) and two tensor fields 
he and y® on ¥ which obey the constraint equations (7.17-18). These 
contraint equations are elliptic equations on the surface Y which 
impose four constraints on the twelve independent components of 
(A, v7"). In such situations, one can show one can prescribe eight of 
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these components independently and then solve the constraint equa- 
tions to find the other four, see e.g. Bruhat (1962). We shall call a pair 
(F, w) satisfying these conditions, an initial data set. We then imbed S 
in some suitable four-manifold .W with metric g and define g” on 
G(S) by (7.12) for some suitable choice of w*. We shall take wu? to be 
g@n,. Thus it will be the unit vector orthogonal to 6() in both the 
metric § and g. We shall also exploit our freedom of choice of W2 in 
the definition of g,,u° by (7.16) to make y zero on 6(.Y). This requires 


we = — 9" \4GeeO 4h + 49.4909 eh” 
te GeataNX —Git quel). (7.19) 


(Note that all the derivatives in (7.19) are tangent to 6() as is required 
by the fact that the fields involved have been defined only on 6(.Y).) To 
ensure that y® vanishes everywhere one also needs y®,,u° to be zero 
on 6(.S). However this now follows from the constraint equations 
providing the reduced Einstein equations (7.8) hold on 6(.7). One 
may therefore proceed to solve (7.8) as a second order non-linear 
hyperbolic system on the manifold 4 with metric &. 

(Note that there are 10 such equations for the ¢’s; in proving the 
existence of solutions of these 10 equations we do not split them into 
a set of constraint equations and a set of evolution equations, and so 
the question as to whether the constraint equations are conserved does 
not arise.) 


7.4 Second order hyperbolic equations 


In this section we shall reproduce some results on second order 
hyperbolic equations given in Dionne (1962). They will be generalized 
to apply to a whole manifold, not just one coordinate neighbourhood. 
These results will be used in the following sections to prove the exist- 
ence and uniqueness of developments for an initial data set (Y, w). 
We first introduce a number of definitions. We use Latin letters to 
denote multiple contravariant or covariant indices; thus a tensor of 
type (r, 8) will be written as K7,, and we denote by [I| = r the number 
of indices that the multiple index J represents. We introduce a positive 
definite metric ¢,, on -@ and define 
el = etbecd ., era, 


——— 
r times y times 


ery = Cap cd eos Cng> 
——_ ee 


234 THE CAUCHY PROBLEM [7.4 


where |J| = |J| =r. We then define the magnitude |K7,| (or simply, 
|K|) as (K!, K"y,e;,e7™)* where repeated multiple indices imply 
contraction over all the indices they represent. We define | D™K7, | 
(or simply, |D™K|) to be |K7,,.| where |L| = m and as before, | indi- 
cates covariant differentiation with respect to §. 

Let WV be an imbedded submanifold of -@ with compact closure 
in M. Then ||K7,,.%||,, is defined to be 


{5 | ‘ ((D>Kt, ytdo}), 


where do is the volume element on .V induced by e. We also define 
|K,4V le to be the same expression where the derivatives are taken 
only in directions tangent to ./. Clearly, ||K,./||,, > ||K,V |. 

The Sobolev spaces W™(r,s, V ) (or simply W™(1 )) are then defined 
to be the vector spaces of tensor fields A‘, of type (r,s) whose values 
and derivatives (in the sense of distributions) are defined almost every- 
where on WV (i.e. except, possibly, on a set of measure zero; for the 
rest of this section ‘almost everywhere’ is to be understood almost 


everywhere) and for which | A/ th is finite. With the norms 


| , ||» the Sobolev spaces are Banach spaces in which the C” tensor 
fields of type (r,s) form dense subsets. If e’ is another contmuous posi- 
tive definite metric on -@ then there will be positive constants C, and 
C, such that 

O,|K7,| < |K7,|"<C,|K7;| on YW, 


and C,| Ky, Tm < ||K4-M Tm’ < On|) Ks, 


m° 


Thus || ,.4|,,’ will be an equivalent norm. Similarly another 0” 
background metric 8’ will give an equivalent norm. In fact it follows 
from two lemmas given below that if 8"eW™(4”) and 2m is greater 
than the dimension of .4”, then the norm obtained using the covariant 
derivatives defined by 8” is again equivalent. 

We now quote three fundamental results on Sobolev spaces. The 
proofs can be derived from results given in Sobolev (1963). They 
require a mild restriction on the shape of 4”. A sufficient condition will 
be that for each point p of the boundary @.4° it should be possible to 
imbed an n-dimensional half cone in .f with vertex at p, where n is 
the dimension of .+”. In particular this condition will be satisfied if 
the boundary @/ is smooth. 
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Lemma, 7.4.1 


There is a positive constant P, (depending on”, e and ®) such that 
for any field K7,e W"(W ) with 2m > n, where nis the dimension of ” ; 


|K| < B|K,/],, on /. 


From this and the fact that the vector space of all continuous fields 
K‘; on ¥ is a Banach space with norm sup |K|, it follows that if 
4 


K!,eW™(4") where 2m > n, then K7, is continuous on WV’. Similarly 
if K7;eW™?(/), then K!, is OP on. WV. 


Lemma 7.4.2 


There is a positive constant P, (depending on ./, e and 8) such that 
for any fields K7,, LPge W™(/) with 4m > n, 


[RTL ON lo < AK, V mL )I|m- 


From this and the previous lemma it follows that ifn < 4and 2m > n, 
then for any two fields K’,, LPgeW™(W), the product K!, LP? g is 
also in W™(VW ). 


Lemma. 7.4.3 


Tf isan (n— 1)-dimensional submanifold smoothly imbedded in./’, 
there is a positive constant P, (depending on /, 4’, e and 8) such 
that for any field K7,6eW™'*'(S ), 


[KM Im < Psl|K,V | ns- 


We shall prove the existence and uniqueness of developments for 
(f,w) when h®e W***(S) and ye W+4(Y) where a is any non- 
negative integer. (If Y is non-compact, we mean by h? « W™S) that 
heeWn( ) for any open subset WV of Y with compact closure.) 
A sufficient condition for this is that h2? be C4+¢ and y” be C3+¢ on F; 
by lemma 7.4.1, a necessary condition is that h? be C?+¢ and y® be 
C1+¢, The solution obtained for g? will belong to W4+#(#) for each 
smooth spacelike surface # and so the (2+ a)th derivatives will be 
bounded, i.e. g” will be C®@+®- on &. 

These differentiability conditions can be weakened to cases such as 
shock waves where the solution departs from W* behaviour on well- 
behaved hypersurfaces; see Choquet—Bruhat (1968), Papapetrou and 
Hamoui (1967), Israel (1966), and Penrose (1972a). However no proof 
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is known for cases in which such departures occur generally. The W* 
condition for the existence and uniqueness of developments is an 
improvement on previous work (Choquet-Bruhat (1968)) but it is 
somewhat stronger than one would like since the Einstein equations 
can be defined in a distributional sense if the metric is continuous and 
its generalized derivatives are locally square integrable (i.e. if g is C° 
and W1). On the other hand any W” conditions for p less than 4 would 


Ficure 48. Y is an open set with compact closure in the manifold .# = ¥ x Rl. 
&, is the region of Y for which t > 0 and W(t’) is the region of W between t = 0 
SUING) ff = i Se 


not guarantee the uniqueness of geodesics, or, for p less than 3, their 
existence. Our own view is that these differences of differentiability 
conditions are not important since as explained in § 3.1, the model for 
space-time may as well be taken to be C2. 

In order to prove the existence and uniqueness of developments we 
now establish some fundamental inequalities (lemmas 7.4.4 and 7.4.6) 
for second order hyperbolic equations, in a manner similar to that of 
the conservation theorem in § 4.3. 

Consider a manifold ./ of the form # x Rt where # is a three- 
dimensional manifold. Let Y be an open set of “with compact closure 
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which has boundary @Y and which intersects (0), where #(t) 
denotes the surface # x {t}, te R!. Let YW, and Y(t’) denote the parts 
of % for which ¢ > 0 and t’ > ¢ > 0 respectively (figure 48). On Y, let 
& be a C?- background metric and let e be a C!~ positive definite 
metric. We shall consider tensor fields A7,; which obey second order 
hyperbolic equations of the form 


L(K) = AP Kg, + Bg Keng t+ Cg Kop = Fly, (7.20) 


where A is a Lorentz metric on WY, (i.e. a symmetric tensor field of 
signature + 2), B, Cand F are tensor fields of type indicated by their 
indices, and | denotes covariant differentiation with respect to the 
metric g. 


Lemma 7.4.4 


If (1) Wn &, is achronal with respect to A, 
(2) there exists some Q, > 0 such that on %,, 


AVt ity <—Q, 
and A®*W,W, 2 Q,e%W,W, 


for any form W which satisfies A”t,,W, = 0, 
(3) there exists some Q, such that on Y,, 


|A| <Q, “|DA| <Q, |B] <@, |C| <Q, 


then there exists some positive constant P, (depending on %, e, §, 
Q, and @,) such that for all solutions K’; of (7.20), 


|K, Hn %,|, < P{|K, #(0)n Zi + |F, Z(t)|\0}- 


One forms the ‘energy tensor’ S® for the field K7, in analogy to the 
energy-momentum tensor of a scalar field of unit mass (§ 3.2): 


Sa = {(A%Aed 14% Aca) KT, K? g—-FAPK, KF olei@erp. (7-21) 


The tensor S* obeys the dominant energy condition (§4.3) with 
respect to the metric A (ie. if W,, is timelike with respect to A then 
SW, W, > 0 and S®W, is non-spacelike with respect to A). Moreover 
by conditions (2) and (3) there will be positive constants Q, and Q, 


such that 
Q3(|K|?+ |DK|?) < S*at, < Qa([K|?+|DK]?). (7.22) 


We now apply lemma 4.3.1 to S®, taking WY, as the compact region F 


238 THE CAUCHY PROBLEM [7.4 


and using the volume element dé and covariant differentiation defined 
by the metric g: 


{ _ 8%, dd, < | _ Sa4, de, 
HOYIN, H (094s 
t 
2 i i ; (PSH, + 8%) dy} a (7.23) 
0 HUY. 


where P is a positive constant independent of S%. (The sign has been 
changed in the first term on the right-hand side since the surface 
element dG, of the surface #(t) is taken to have the same orientation 
ast, ie. dé, = t, dé where dé is a positive definite measure on #(t).) 
Since e and § are continuous there will be positive constants Q; and Q, 


such that on Y, Q:do <de < Q,da, (7.24) 


where do is the area element on #(t) induced by e. Thus by (7.22) 
and (7.23) there is some Q, such that 


IK, 200 Wh < Q,{ IK, 2600) 0° 


t t 
+| |K, #(t')n 4 lta + | (SS, bdo') au’. (7.28) 
0 0 
By (7.20), 
S®, = AWK! |, FPo e7%e;5+ (terms quadratic in K/, and 


K? 9), with coefficients involving A‘, A“, 


A 


Ro aes, Bg and CP!g;). (7.26) 
Since the coefficients are all bounded on Y,, there is some Q, such that 
SM ota < Os{|F|? +|K]? + |DK]3}. (7.27) 

Thus there is some Q, such that, from (7.25) and (7.27), 


IK, 260) Wh? < Qf IK, 2(0)n % I? 
e | [K, (yn &,|],240’ +B, moi. 
This is of the form da/dt < Q,{x+y}, (7.28) 
alin z(t) = I) IK, 20) %, Par’ 


t 
Therefore fa core e—Ost'y(t’) dé’. (7.29) 
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Since y is a monotonically increasing function of ¢ and since ¢ is 
bounded on @%,, there is some Q,, such that 


®< Qi. 
Thus ||K, #(t)n @,||, < P,{|K, 7(0)n %,|,+||F, ZO], where 


ES (Qo + Q0)?- O 
With this inequality one can immediately prove the uniqueness of 
solutions of second order hyperbolic equations which are linear, i.e. 
for which A, B, C and F do not depend on K. For suppose AY, and 
K~, were solutions of the equation Z(K) = F which had the same 
initial values and first derivatives on #(0)n WY. Then one can apply 
the above result to the equation Z(K!— K?) = 0 and obtain 


\K2—K?, #(t)n Z|], = 0. 
Therefore K! = K? on Y%,. One has thus 


Proposition 7.4.5 

Let A be a C!- Lorentz metric on .#@ and let B, C, and F be locally 
bounded. Let ¥ <M be athree-surface which is spacelike and acausal 
with respect to A. Then if V is a set in D*(#,'A), the solution on V of 
the linear equation (7.20) is uniquely determined by its values and 
the values of its first derivatives on #0) J-(V, A). 


By proposition 6.6.7, D*+(#, A) is of the form # x Rl. If geV, then 
by proposition 6.6.6, J-(q) fn J+(#) is compact and so may be taken 
for Y,. Oo 


Thus a physical field obeying a linear equation of the form (7.20) will 
satisfy the causality postulate (a) of §3.2 provided the null cone of A 
coincides with or lies within the null cone of the space-time metric g. 

In order to prove the existence of solutions of the equations (7.20) 
we shall need inequalities for higher order derivatives of K. We shall 
now take the background metric g to be at least C®+* where a is a non- 
negative integer and we shall take Y to be such that #(0)n U has 
a smooth boundary and such that there is a diffeomorphism 


A: (#(0)n %) x [0, > F%, 
which has the property that for each t€[0, ¢,], 
A(H(0)n HZ), = H(t)n U,. 


We do this so that there shall be upper bounds P,, P, and P, to the 
constants P,, P,and P,in lemmas 7.4.1—7.4.3 for the surface (by 
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Lemma 7.4.6 
If conditions (1) and (2) of lemma 7.4.4 hold and if 
(4) there is some Q, such that 
A, U Nara < Qs, |B, Us \\s+0 <@3, |G, Us \s+0 <Q; 
(by lemma 7.4.1, this implies condition (3)), then there exist positive 
constants P; , (depending on %, e, §, a, Q, and Q;) such that 


5, a 


IK, HQ) lara < Ps af[K, H(0)N Zara t ||P FO)sra}- (7.30) 


From lemma 7.4.4 one has an inequality for ||K, #(é)n %,.||,. To obtain 
an inequality for ||K, #(t)n @%.||, one forms the ‘energy’ tensor S% 
for the first derivatives K‘,,, and proceeds as before. The divergence 
S, can now be evaluated by differentiating equations (7.20): 


8%, = A4@K! | 4 FP, ee! Pe;p + (terms quadratic in K/,. 
K!,,.and K!,,., with coefficients involving A“, 
Jie Jied & 
no ce ic tae eT a, CPAs 
and CP!) 5,4). (7.31) 


With the possible exceptions of B°P!9 ;,,and CP!) ; z, these coefficients 
are all bounded on Y,, in the case a = 0. When integrated over the 
surface #(t') &,, the term in (7.31) involving BePly jig is 


-| AMEE 56 BOP Syl pg? cnet g. (7.32) 
HUN, , 
There is some Q, such that for all t’, (7.32) is less than or equal to 
Q, i |DB] [DK] |D*K| do 
HEV AN 


< 10, | (|D?K|?+ |DB|?|DK|2)do. (7.33) 
HEN Us 
By lemma 7.4.2, 


— |DB|? |DK|?do < P2||B, H(t’) U,||.?|K, H(t’) n 02, 


where, by condition (4) and lemma 7.4.3, |B, ¥(¢')/n Wii be OP Ox 
The term involving C?/, ;,, can be bounded similarly. Thus by lemma 
4.3.1 there is some constant Q, such that 


(ee. (|D°K| + |DK]|?) do < 1 (|D?K|? + |DK]?) do 


t 
+ [IK 260 0 when's { DF|Pac}. (7.34) 
Ut) 
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By lemma 7.4.4, 


[reone, RPA < IK 2600 Wh 
HOU, 
< 2PAVK, 7(0)n @2+|F, V7}. (7.35) 
Adding this to (7.34), one obtains 
IK, 20) WP < Qf IK, 2(0)0 Ula? 
t 
+ i 1K 2) [ede +|F, aot]. (7.36) 
where Q, = Q,+ 2P,. By a similar argument to that in lemma 7.4.4, 
there is some constant Q, such that 
(K, (0) n Wo < Q{|K, #(0)n Z.+||F, Z)|g- (7.37) 
From lemma 7.4.1 it now follows that on 7, 
|K| < P,Q,{|K, #(0)n %.+|F, ZO)|o}- (7.38) 


Using this one may proceed in a similar way to establish an inequality 
for |K, #(t)n W,,||5. The divergence of the ‘energy’ tensor now gives 
a term of the form 


0.{ (|D®K|? + | D?B|? |DK/2) do. (7.39) 
H(t) U, 


By lemma 7.4.2 the second term above is bounded by 
Q.P,? |B, H(t) 0 U.|.>||K, HW) 0 U2? 


where by condition (4), ||B, #(t)n Z..||, is defined for almost all 
values of ¢’ and is square integrable with respect to t’. Thus one can 
obtain an inequality for ||K, #(t)n Y,||, in the same manner as for 
|K, #(t)n W,||,. The procedure for higher order derivatives is 
similar. oO 


Corollary 
There exist constants P, , and P, , such that 
|K, (00 Ware < Fe af|K, (0) Ulara 
+ || K7 7.4%, #(0)N Usiat |F, U,|s+0}> 
and |K, Villara < PF, {ditto}, 


where wu is some C?+¢ vector field on #(0) which is nowhere tangent 


to #(0). 
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By (7.20), the second and higher derivatives of K out of the surface 
HH (0) may be expressed in terms of F and its derivatives out of #(0), 
K!,,,u* and derivatives of K in the surface #(0). By lemma 7.4.3, 


A, #(0)N Ulsra < Ps Qs, 
B, #(0)N Vlora < P3Qs, 
C, #(0)N Vlora < PQs; 
F, £°(0)N Ura < PoilF, Us lsra° 
Thus there will be some constant Q, such that 
|K, (0)N Vara < Val K, 4°(0) 0 Uara 
+ | K77)qU%, 4(0)N Wa+at F: Mt ,\orab (7.41) 


(7.40) 


The second result follows immediately, since tis bounded on @,. O 


We can now proceed to prove the existence of solutions of linear 
equations of the form (7.20). We first suppose that the components of 
A, B, C, F, u and § are analytic functions of the local coordinates 
x}, x?, 23 and x4 (xt = t) ona coordinate neighbourhood ¥ and take the 
initial data K1, = ,K7, and K!,,u* = ,K‘, to be analytic functions 
of the coordinates x1, x? and 23 on #(0)n ¥ . Then from (7.20) one can 
calculate the partial derivatives é?(A7,)/ét?, 03(A7,)/ét? é2*, B(A7,)/et8, 
etc. of the components of K out of the surface #(0) in terms of 
derivatives of ,)K and ,K in # (0). One can then express A7, as a formal 
power series in 21, x”, x3 and ¢t about the origin of coordinates p. By the 
Cauchy—Kowaleski theorem (Courant and Hilbert (1962), p. 39) this 
series will converge in some ball ¥(r) of coordinate radius r to give a 
solution of (7.20) with the given initial conditions. One now selects 
an analytic atlas from the C® atlas of .@. covers #(0)n @ with co- 
ordinate neighbourhoods of the form ¥ (r) from this atlas. and in each 
coordinate neighbourhood constructs a solution as above. One thus 
obtains a solution on a region 7(t,) for some t, > 0. One then repeats 
the process using # (t,). By the Cauchy—Kowaleski theorem, the ratio 
of successive intervals of t for which the power series converges is 
independent of the initial data and so the solution can be extended to 
the whole of WY, in a finite number of steps. This proves the existence 
of solutions of linear equations of the form (7.20) when the coefficients, 
the source term and the initial data are all analytic. We shall now 
remove the requirement of analyticity. 
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Proposition 7.4.7 
Tf conditions (1), (2) and (4) hold and if 


(eee 47); 
(6) KeW44(#7(0)n %), Ke W+4((0)n Z), 


then there exists a unique solution K « W4+*(Y,) of the linear equation 
(7.20) such that on #0), K7, = .K1, and K!,,,u* = ,K’,. 


We prove this result by approximating the coefficients and initial data 
by analytic fields and showing that the analytic solutions obtained 
converge to a field which is a solution of the given equations with the 
given initial conditions. Let A, (n = 1,2,3,...) be a sequence of 
analytic fields on Y, which converge strongly to Ain W4+4(@%,). (A, is 
said to converge strongly to Ain W if ||A, — All, converges to zero.) 
Let B,,, C,, and F,, be analytic fields on WY, which converge strongly 
to B, Gand F respectively in W3**(@%_,), and let ,>K,, and ,K,, be analytic 
fields on #(0)N YW which converge strongly to 9K and ,K in 
W4+a( /(0)n Z) and W3+2(4/(0)n Y) respectively. For each value 
of there will be an analytic solution K,, to (7.20) with the initial 
values “Ki? , = ok nip, Kal net’ = 14,17 By the corollary to lemma 
7.4.6, |K,, W\|4.9 Will be bounded as n-> co. Therefore by a theorem 
of Riesz (1955) there will be a field Ke W4*+7(@_,) and a subsequence 
K,,, of the K, such that for each b, 0<b <4+a, D°K,, converges 
weakly to D°K. (A sequence of fields [,7; on W is said to converge 
weakly to I/, if for each C® field J7,, 


i ese do i 1, J, ac.) 
NM N 


Since A, B,, and C,, converge strongly to A, B and C in W*(@,), 
sup|A—A,|, sup|B—B,| and sup|C—C,]| will converge to zero. 
Thus L,,(K,,-) will converge weakly to L(K). But L,,(K,,-) isequal to F,, 
which converges strongly to F. Therefore L(K) = F. On #(0)N U 
K,/7, and K,,7,,u* will converge weakly to K’; and K7;,,u* which 
must therefore be equal to ,K7, and ,K‘, respectively. Thus K is a 
solution of the given equation with the given initial conditions. By 
proposition 7.4.5 it is unique. Since each K,, satisfies the inequality in 
lemma 7.4.6, K will satisfy it also. Oo 
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7.5 The existence and uniqueness of developments for the 
empty space Einstein equations 


We shall now apply the results of the previous section to the Cauchy 
problem in General Relativity. We shall first deal with the Einstein 
equations for empty space (7'% = 0), and shall discuss the effect of 
matter in §7.7. 

The reduced Einstein equations 


E®, (gcd) = 87T% — (Ro — 4 RG”) (7.42) 


are guast-linear second order hyperbolic equations. That is, they have 
the form (7.20) where the coefficients A, B and C are functions of 
K and DK (actually, in this case A? = g” is a function of ¢” and 
not of d,). To prove the existence of solutions of these equations we 
proceed as follows. We take some suitable trial field ¢’” and use this 
to determine the values of the coefficients A, B and Cin the operator L. 
Using these values we then solve (7.42) as a linear equation with the 
prescribed initial data and obtain a new field 6”. We thus have a 
map a which takes ¢’ into d”, and we show that under suitable condi- 
tions this map has a fixed point (i.e. there is some ¢ such that 
a() = #). This fixed point will be the desired solution of the quasi- 
linear equation. . 

We shall take the background metric to be a solution of the empty 
space Einstein equations and choose the surfaces #(t)n YW, and 
dU U, to be spacelike in 8. Then by lemma 7.4.1 there will be some 
positive constants Q, such that if for some value of a > 0 


|S’, Vallara < (7.43) 


then the coefficients A’, B’ and C’ determined by @’ satisfy conditions 
(1), (2) and (4) of lemma 7.4.6 for given values of Q, and Q,. From 
(7.41) one then has 


|9", Villaza < Py allloh, (0) n UW ssat ab, FF(0) 0 ee 
Thus the map «: W4+4(%,)-> W4+4(Y, ) will take the closed ball Wir) 
of radius 7 (r < Q,) in W4ta(w ,) into itself provided that 
lob, (0) Basa < AYP, 
and I, (0) 0 Zasa < APP a. (7.44) 


We shall show that a has a fixed point if (7.44) holds and if r is 
sufficiently small. 
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Suppose ¢,’ and @,' are in W(r). The fields ¢,” = a(@,’) and 
2 =a(h,') satisfy E,'(p,") = 0, E,'(p,”) = 0 where E,' is the 
Einstein operator with coefficients A,’, B,’ and C,’ determined by dy’. 

Thus , ” " ’ , ” 
EExd, ~ 2") = =(Ki — E,') (¢,"). (7.45) 


Since the coefficients A,’, B,’ and C,’ depend differentiably on d,' and 
D¢,’ for d,’ in W(r), there will be some constant Q, such that on Y a 


JAY-A’sl < Q164- "|, 
|B’, -B’| < Q4(/6'1- $2| + |Do',-D¢’,]), (7.46) 
|C’";—C',| < Q4(|61—- | + |Dh’, -— D’,)). 

Therefore by lemmas 7.4.1 and 7.4.6, 


(2 — B's) (@"2)| < 87Qy Py Pra Fal. — 6's] + [Do's - De’,|). 


We now apply lemma 7.4.4 to (7.45) to obtain the result 
|P'1-$"», Ul, < 1Qs|16'1—- "2, Vall, (7.47) 


where @, is some constant independent of r. Thus for sufficiently 
small 7, the map a will be contracting in the | ||, norm (i.e. 
|x(p,) —x(po)|1< | 6; —G/|,) and the sequence «”(¢’,) will converge 
strongly in W1(Y_,) to some field @. But by the theorem of Riesz some 
subsequence of the o”($’,) will converge weakly to some field 
d € W(r). Thus ¢ must equal ¢ and so be in W(r). Therefore a(@) will 
be defined. Now 


lah) — a 41(h'1), Walla < 7Q5||6—2('), Wall 


As n->co, the right-hand side tends to zero. This implies that 
llx(@) -$, Y,||, = 0 and so that a(@) = d. Since the map « is con- 
tracting the fixed point is unique in W(r). We have therefore proved: 


Proposition 7.5.1 

If & is a solution of the empty space Einstein equations, the reduced 
empty space Einstein equations have a solution de W**4(Y,) if 
ob, 2(0)N Bara and |p, (0) Zi,,. are sufficiently small. 
lp, (0) N Z,||44q Will be bounded and so ¢ will be at least C@+9-. 0 


This solution will be locally unique even among solutions which are 
not in W4(%,,). 
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Proposition 7.5.2 

Let ¢ be a C1 solution of the reduced empty space Einstein equations 
with the same initial data on an open set V © #(0)n Y. Then p= 
on a neighbourhood of V in @,. 


Since ¢ is continuous one can find a neighbourhood @ of Vin W such 
that the conditions of lemma 7.4.4 hold for A, B and C. As before one 


oo Big -) = -(£-B)(¢). (7.48) 
Similarly there will be some Q, such that 
| (Z-E) (P), HW) 0 Uo < WiG—-F. H (0) WI. 
Applying lemma 7.4.4 to (7.48) one obtains an inequality of the form 
dz/dt < Q,2, 
ee me i Id —$, #10) W,||, at’. 


Therefore @ = ¢ on Y,. o 


Proposition 7.5.1 shows that if one makes a sufficiently small 
perturbation in the initial data of an empty space solution of the 
Einstein equations one obtains a solution in a region Y@,>What one 
wants however is to prove the existence of developments for any initial 
data h® and y® which satisfy the constraint equations on a three- 
manifold . To do this we proceed as follows. We take -@ to be RF, 
e to be the Euclidean metric and § to be the flat, Minkowski metric 
(this is a solution of the empty space Einstein equations). In the usual 
Minkowski coordinates z1, x?, x3 and x* (x? = t) we take Y to be such 
that @&%n W,, is spacelike and #(0)9 & consists of the points for 
which (x1)? + (x?)? + (23)? < 1,24 = 0. The idea now is that any metric 
appears nearly flat if looked at on a fine enough scale. Therefore if one 
maps a sufficiently small region of Y onto #(0)n WY, one can use 
proposition 7.5.1 and obtain a solution on ¥%,. We then repeat this for 
other portions of Y and join up the resulting solutions to form a 
manifold @ with metric § which is a development of (.Y, w). 

Let ¥, be a coordinate neighbourhood in Y with coordinates y}, y? 
and y° such that at p, the origin of the coordinates, the coordinate 
components of h” equal 6. Let %(f,) be the open ball of coordinate 
radius f, about p. Define an imbedding 6,: %(f;) > by a = f,-1y* 
(2 = 1, 2,3), 24 = 0. By the usual law of transformation of a basis, the 
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components of 0,” and 6,.v~ with respect to the coordinates {x} are 
fy, times the components of h® and y® with respect to the co- 
ordinates {y}. We define new fields h’# and y'® on ¥, by h’® = f,2he 
and y’® = f,3 y®. Then since h is continuous (in fact C2+*) on Y one 
can make g’%—@ and g’.u° arbitrarily small on #(0)n YW by 
taking f, sufficiently small, where g’% and g’@°,.u° are defined from 
h’ and y’® in the manner of §7.3. The derivatives of g'@ and g’®,,u° 
in the surface .#(0) will also become smaller as f, is made smaller. 
Thus ||’, #(0)n Zi4., and |’, #(0)n Z5,, can be made small 
enough that proposition 7.5.1 can be apphed and a solution for ¢’ 
obtained on W.. Then gf? = f,-?g'@ will be a solution of the reduced 
Einstein equations with the initial data determined by h® and y@. 
Similarly one can obtain a solution on W_, the part of Y on whicht < 0. 

One can now cover ¥ by coordinate neighbourhoods ¥,(f,) of the 
form ¥;(f,), map them by imbeddings 6, to neighbourhoods %, of the 
form Y and obtain solutions 9,7 on Y%,. The problem now is to 
identify suitable points in the overlaps to make the collection of the 
UW. into a manifold with a metric §. To do this we make use of the 
harmonic gauge condition 


pe. = g”\.— $9°°G30JX 'c = Q. (7.49) 


By the definition (7.3) of 6I%,,, this is equivalent to g# dT), = 0. 
Therefore for any function 7, 


CZ = Zang? — OT ap zieg” — Bea ae (7.50) 


If the background metric is the Minkowski metric and z is one of the 
Minkowski coordinates 2}, x2, x3 and 24, the right-hand side of (7.50) 
will vanish. Suppose now one has an arbitrary W*** Lorentz metric g 
on a manifold .@. In some neighbourhood Y < -@ one can find four 
solutions z!, 27, z3 and 24 of the linear equation 


2.ap9@ = 0 (7.51) 


which are such that their gradients are linearly independent at each 
point of % We may then define a diffeomorphism p: Y>M by 
am 2% (a = 1,2,3, 4). This diffeomorphism will have the property 
that the metric ,g% on HM will satisfy the harmonic gauge condition 
with respect to the Minkowski metric § on dd. Thus if the metric g is 
a solution of the Einstein equations on 4%, the metric #,8 will be 
a solution of the reduced Einstein equations on Awith the background 
metric g. 
9-2 
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The procedure to identify points in the overlap between two neigh- 
bourhoods %, and %, is therefore to solve (7.51) on U, for the coordi- 
nates x,1, x4", «4° and a,* using the initial values for x, and a 4%, 
determined by the overlap of the coordinate neighbourhoods ¥, and 
V, on FS. In fact x,',.u%7=0 (1 = 1, 2, 3) and x,4,,u7 = 1 where 
ut = G/éx,%is the unit vector in Y, orthogonal to #(0) in the metric 8. 
Thus,‘ = x,‘ though z,’ will not in general be equal to x,. By proposi- 
tion 7.4.7. the coordinates x,* will be C@+®- functions on ae (in 
proposition 7.4.7 the background metric with respect to which the 
covariant derivatives are taken has to be C6+@-. Thus it cannot be 
applied directly to (7.51), since the covariant derivatives are taken 
with respect to $, which is only W4+*, However one can introduce 
a C5+¢ background metric & and express (7.51) in the form 


ZiobI + Zab = 9, 


where || indicates covariant differentiation with respect to g. Proposi- 
tion 7.4.7 can then be applied to this equation.) 

Since the gradients of x,“ are linearly independent on #(0)n %,, 
they will be linearly independent on some neighbourhood @%’, of 
H(0) in Y,. The metric n,g% will be at least C!- on u(Y",) in 
U ,. Since it will obey the reduced empty space Einstein equations 
on %, in the background metric g and since it has the samesinitial data 
on 0,(¥,n ¥,), it must coincide with g, on some neighbourhood @%,’ of 
6(¥,0 ¥,) in V,. This shows that one may join together WY”, and U’, 
to obtain a development of the region ¥,U¥, of Y. Taking the 
covering {¥,} of Y to be locally finite, one may proceed in a similar 
fashion to join together the subsets of the other neighbourhoods {Y,} 
to obtain a development of Y, i.e. a manifold W with a metric § and 
an imbedding 6: Y >. such that § satisfies the empty space Einstein 
equations and agrees with the prescribed initial data w on 6(/), which 
isa Cauchy surface for @. If (-#’, 8’) is another development of (Y,w) 
one can by a similar procedure establish a diffeomorphism ys between 
some neighbourhood of 6'(’) in. #’ and some neighbourhood of 6(.7) 
in @ such that w,g’ = g%. We have therefore proved: 


The local Cauchy development theorem 

If h« W4*4(f) and xy” « W3+4(S) satisfy the empty space constraint 
equations there exist developments (#,§) for the empty space 
Einstein equations such that g¢ W4+4(.@) and ge W4+4(#) for any 
smooth spacelike surface #. These developments are locally unique 
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in that if (.#’, 8’) is another W4+¢ development of (Y, w) then (4, 8) 


and (.#’,§') are both extensions of some common development of 
(SF, od). 


That ge W*2(4) follows from lemma 7.4.6 since the surfaces of 
constant ¢ can be chosen arbitrarily. O 


7.6 The maximal development and stability 


We have shown that if the initial data satisfied the empty space 
constraint equations one can find a development, i.e. one can construct 
a solution some distance into the future and past of the initial surface. 
In general, this development can be extended further into the future 
and past to give a larger development of (%, w). However we shall 
show by an argument similar to that of Choquet-Bruhat and Geroch 
(1969) that there is a unique (up to a diffeomorphism) development 
(4,8) of (F,w) which is an extension of any other development 
of (SF, w). 

Recall that (.@,. 8,) is an extension of (-4,, 8.) if there is an imbed- 
ding uw: #,—>-M, such that ~,8 = &,, and such that 6,-1w@, is the 
identity map on ¥. Given a point ge, and a distance s one can 
uniquely determine points p,€-, and p,¢.@, by going a distance s 
along the geodesics orthogonal to 6,(.Y) and 6,(/) through 6,(q) and 
4,(q) respectively. Since “(p,) must equal p,, the imbedding 4 must be 
unique. One can therefore partially order the set of all developments 
of (F,w), writing (MG, 8.) < (4, 81) if (4G, 8) is an extension of 
(M,, 85). If now {(-4,, &,,)} is a totally ordered set (a set .o/ is said to be 
totally ordered if for every pair a, b of distinct elements of ., either 
a <borb < a) of developments of (/, w), one can form the manifold 
M' as the union of all the 4, where for (4,,8,) < (4, ,) each 
p,€-M, is identified with ,,(p,)€4,, where py,: M,> M, is the 
imbedding. The manifold -@’ will have an induced metric g' equal to 
Hae Bo On each f4,(4,) where u,: 4, M’ is the natural imbedding. 
Clearly (.@’, &’) will also be a development of (¥, w); therefore every 
totally ordered set has an upper bound, and so by Zorn’s lemma (see, 
for example, Kelley (1965), p. 33) there is a maximal development 
(&@,%) of (f, w) whose only extension is itself. 

We shall now show that (.@, &) is an extension of every development 
of (FY, w). Suppose (W’, §’) is another development of (7, w). By the 
local Cauchy theorem, there exist developments of (7, w) of which 
(.&@, ®) and (.M’, 8’) are both extensions. The set of all such common 
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developments is likewise partially ordered and so again by Zorn’s 
lemma there will be a maximal development (.@", 8”) with the imbed- 
dings i: W"> WM and pw’: WM’ >M, etc. Let 4 be the union of 
M, M' avd M", where each p"€.M’ is identified with fi(p")¢€.@ and 
b'(p") eM’. If one can show that the manifold .@* is Hausdorff, the 
pair (4+, §*) will be a development of (.Y, w). It will be an extension 
of both (a , 8) and (.4@’, 8’). However the only extension of (M , 2) Is 
(.M@, &) itself, and so (@, &) must equal (.W+, $+) and be an extension 
of (.4’, 8’). 

Suppose that .@+ were not Hausdorff. Then there exist points 
Pe(fi(4")) < Mand p'e(u'(M")) <M’ such that every neighbour- 
hood &@ of — has the property that w’(#—1(W)) contains p’. Now since 
(-4@", 8") is a development, it will be globally hyperbolic as will its 
image fi(.@") in @. Therefore the boundary of f(A") in MM rust be 
achronal. Let y be a timelike curve in .@ with future endpoint at #. 
Then p’ must be a limit point in -#’ of the curve n’A—(y). In fact it 
must be a future endpoint, since strong causality holds in (.@’, ¢’). 
Thus the point p’ is unique, given /. Further, by continuity vectors 
at p’ can be uniquely associated with vectors at ». Thus one can find 
normal coordinate neighbourhoods Y of pin. # and W of p in-M’ such 
that under the map yf“! points of Yn fi(.W”) are mapped into points 
of &' 9 p'(#") with the same coordinate values. This shdws that the 
set ¥ of all ‘non-Hausdorff’ points of (a(-@"))’ is open in (f(.W"))’. 
We shall suppose that¥ is non-empty, and so obtain a contradiction. 

If A is a past-directed null geodesic in.# through je.F, then since 
one can associate directions at p with directionsat p’, one can construct 
a past-directed null geodesic 4’ through p’ in .#’ in the corresponding 
direction. To each point of An (f(.#”))’ there will correspond a point 
of A’n (u'(-#"))' and so every point of An (u(.@"))’ will be inF¥. Since 
O(S) isa Cauchy surface for./, X must leave (4(4")) at some point §. 
There will be some point FE F ina neighbourhood of § such that there 
is a spacelike surface # through 7 which has the property that 
(# —#) < ji( MW"). There will be a corresponding spacelike surface 


we = (M(H — 7))Ur’ in -@’ through the corresponding point r’. 
The surfaces # and #' may be regarded as images of a three- 
dimensional manifold # under imbeddings yr: He». anda 
Y': # > M such that y-Yip’—1yy’ is the identity map on # —yr-1(p). 
The induced metrics y,(%) and 's(8') on # will agree since = 
and #” —p’ are isometric. By the local Cauchy theorem, they will be 
in W4*«(.#). Similarly the second fundamental forms will agree and 
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be in W*+4(4). Neighbourhoods of # in-# and #’ in.W’' would be 
W4* developments of #. By the local Cauchy theorem they must be 
extensions of the same common development (.@*,$*). J oining 
(*, o*) to (-@", ") one would obtain a larger development of (7, w), 
of which (.@, &) and (.@’, 8’) would be extensions. This is impossible, 
since (#", §") was the largest such common development. This shows 
that .@+ must be Hausdorff, and so that (4, &) must be an extension 
of (.4’, 8’). 
We have therefore proved: 


The global Cauchy development theorem 

TfhM ec W4+4(F) and yx? € W3+4(S) satisfy the empty space constraint 
equations, there exists a maximal development (.W, 8) of the empty 
space Einstein equations with g ¢ W4+4(.@) and ge W4+*(#) for any 
smooth spacelike surface #. This development is an extension of any 
other such development. 


We have so far only proved that this development is maximal among 
W*+2 developments. If a is greater than zero, there will also be 
Wira-l jyate-2 .., W* developments which are extensions of the 
W+~¢ development. However, Choquet-Bruhat (1971) has pointed out 
that these developments must all coincide with the W4 development. 
This is because one can differentiate the reduced Hinstein equations 
and then regard them 4s linear equations on the W4 development, for 
the first derivatives of g. Then using proposition 7.4.7 one can show 
that g” is W® on the W4 development. if the initial data is W5. By 
continuing in this way, one can show that if the initial data is C~, there 
will be a C® development which will in fact coincide with the W4 
development. 

We have proved the existence and uniqueness of maximal develop- 
ments only for W? or higher metrics. In fact, it is possible to prove the 
existence of developments for W® initial data, but we have not been 
able to prove the uniqueness in this case. It may be possible to extend 
the W4 maximal development either so that the metric does not remain 
in W4, or so that 6(SY) does not remain a Cauchy surface. In the latter 
case, a Cauchy horizon occurs; examples of this were given in 
chapter 6. On the other hand it may be that some sort of singularity 
occurs, in which case the development cannot be extended with a 
metric which is sufficiently differentiable to be interpreted physically. 
In fact, theorem 4 of the next chapter will show that if Y 1s compact 
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and y@h,, is negative everywhere on /, then the development cannot 
be extended to be geodesically complete with a C?~ metric, i.e. with 
locally bounded curvature. 

We have shown there is a map from the space of pairs of tensors 
(he, y2*) on Y which satisfy the constraint equations to the space of 
equivalence classes of metrics § on a manifold .@, which, by proposi- 
tion 6.6.8, is diffeomorphic to “x R. If two pairs (h®, y%) and 
(h’2>, y'2) are equivalent under a diffeomorphism A: S> (i.e. 
Ay A® = hh’ and A, x = x'%) they will produce equivalent metrics §. 
We thus have a map from equivalence classes of pairs (9, vy") to 
equivalence classes of metrics 8. Now h” and y® together have twelve 
independent components. The constraint equations impose four rela- 
tions between these, and the equivalence under diffeomorphisms may 
be regarded as removing a further three arbitrary functions, leaving 
five independent functions. One of these functions may be regarded as 
specifying the position of 6(/) within the development (-4, §). There- 
fore maximal developments of the empty space Einstein equations 
are specified by four functions of three variables. 

One would like to show that the map from equivalence classes of 
(h2, v2) to equivalence classes of § is continuous in some sense. The 
appropriate topology on the equivalence classes for this is the 
W' compact-open topology (cf. §6.4). Let & be a C” Lorentz metric on @ 
and @ be an open set with compact closure. Let V be an open set in 
W"(d) and let O(Y, V) be the set of all Lorentz metrics on. W whose 
restrictions to W lie in V. The open sets of the W" compact open 
topology on the space ¥(.#) of all W" Lorentz metrics on .W are 
defined to be the unions and finite intersections of sets of the form 
O(U, V). The topology of the space &*(.#) of equivalence classes of 
W’ metrics on -# is then that induced by the projection 


m: L(M) > L*(M) 


which assigns a metric to its equivalence class (i.e. the open sets of 
,*(M) are of the form 7(Q) where Q is open in £,(M)). Similarly the 
W’ compact open topology on the space Q,(%) of all pairs (h@, sya) 
which satisfy the constraint equations is detined by sets of the form 
O(W’, V, V’) consisting of the pairs for which h® € V and vy € V’ where 
V and V’ are open sets in W*(Y) and Wr.) respectively. The C© 
metrics on .@ form a subspace ¥,(.#) of the space Y (@) of all 
Lorentz metrics on .#. Since a C® metric is I" for any r, one has the 
W’ topology on &,(M). One can then define the C® or W2 topology 
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on £,,(-@) as that given by all the open sets in the W’ topologies on 
~£.(4) for every r. The C” topology on ¥,,*(@) and on Q,(S) are 
defined similarly. 

One would like to show that the map A, from the space Q,*(S) of 
equivalence classes of pairs (h®, y%) to the space Y*(#) of equiva- 
lence classes of metrics is continuous with the W” compact open 
topology on both spaces. In other words, suppose one has initial data 
he WS) and y? « W"-1(F) which gives rise to a solution $c W(.4) 
on .#. Thenif ¥ isa region of W with compact closure, and é > 0, one 
would like to show there was some region Y of Y with compact closure 
and some é > 0 such that ||g’ -§, ||, < ¢ for all initial data (h’2®, y'%”) 
such that ||h’—h, #@||, < 46 and lx’ -x Wp < 10. This result may 
be true, but we have been unable to prove it. What we can prove is 
that this result holds if the metric is C?+)-. This follows immediately 
from proposition 7.5.1, taking g to be the background metric and Y 
to be some suitable neighbourhood of J-(V)n J+(6(SF)). In fact if one 
examines lemma 7.4.6, one sees that the condition on the background 
metric can be weakened from C’+)- to W’+», but not to W”, since the 
(r— 1)th derivatives of the Riemann tensor of the background metric 
appear. (By the background metric being W**! we mean that it is 
W*+4 with respect to a further C’+! background metric.) Thus the map 
A,: Q,*(F) > Z*(#) from the equivalence classes of initial data to 
the equivalence classes ofmetrics will be continuous in the W* compact 
open topology at every W**! metric. Although the W**? metrics form 
a dense set in the W” metrics, there is a possibility that the map might 
not be continuous at a W* metric which was not also a W**! metric. 
However 00+ 1 = 00 and so the map A,,: 0*,,(7) > L*,,(4) will be 
continuous in the C” topology on both spaces. 

One can express this result as: 


The Cauchy stability theorem 

Let (.@,g) be the W*** (0 < a < ©) maximal development of initial 
data he W*+4(.S) and xe W4+4(F), and let V be a region of J+(A(7)) 
with compact closure. Let Z be a neighbourhood of g in-#;,,(¥) and 
U be an open neighbourhood in 6(.Y) of J— (V)n OF) with compact 
closure. Then there is some neighbourhood Y of (h, x) in Q;,,(@) such 
that for all initial data (h’, yx’) ¢ Y satisfying the constraint equations, 
there is a diffeomorphism yw: 4’ —>-4 with the properties 

(1) 6-440’ is the identity on 0-1(Z), 
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(2) x8’ EZ, 
where (.4@’, 8’) is the maximal development of (h’, x’). O 


Roughly speaking what this theorem says is that if the perturbation 
of initial data on the Cauchy surface 6(Y) is small on J-(V’)n OF), 
then one gets a new solution which is near the old solution in ¥”. In 
fact the perturbation of the initial data has to be small on a slightly 
larger region of the Cauchy surface than J -V)n OF), since the null 
cones will be slightly different in the new solution and so ¥ may not 


lie in the Cauchy development of J-(V) n (SF). 


7.7. The Einstein equations with matter 


For simplicity we have so far considered the Einstein equations only 
for empty space. However similar results hold when matter is present 
providing that the equations governing the matter fields V(,’, obey 
certain physically reasonable conditions. The idea is to solve the 
matter equations with the prescribed initial conditions in a given 
space-time metric §’. One then solves the reduced Einstein equations 
(7.42) as linear equations with the coefficients determined by §’ and 
with the source term 7”? determined by g’ and by the solution for the 
matter fields. One thus obtains a new metric §” and repeats the 
procedure with §” in place of g’. To show that this converges to a 
solution of the combined Einstein and matter equations one has to 
impose certain conditions on the matter equations. We shall require: 

(a) if (K°Yiype W4+4(#) and {,¥(y} e W3+4(#) are the initial data on 
an achronal spacelike surface # in a W4*@ metric g, there exists a 
unique solution of the matter equations in a neighbourhood of # in 
D*t(#) with (Wy}e W4+4(#’) for any smooth spacelike surface #’, 
ane Ko = oo VYolneu*=1¥es on #; 

(6) if {Wp} is a W>* solution in the W5+@ metric g on the set Y es 
then there exist positive constants Q, and Q, such that 


~ [P'o- Pw, Us \lata S Qo | S =o; Vale 
ot » oP oF (ps H (0) nN Ae af ~ rege _ he te H (0) f) Wllaxa} 
for any W**¢ solution {W’)} in the metric g’ such that 


|e’ —g, Waa = Q, 
and ‘ 


% {llo® wo — oF» FH(0)N UMlara t+ [Po —1Pw, (0) Ua} <0; 
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(c) the energy-momentum tensor 7, is polynomial in 
Gee Gey. and “9%? 


Condition (a) is the local Cauchy theorem for the matter field in 
a given space-time metric. Condition (6) is the Cauchy stability 
theorem for the matter field under a variation of the initial conditions 
and under a variation of the space-time metric §. If the matter 
equations are quasi-linear second order hyperbolic equations, these 
conditions may be established in a similar manner to that for the 
reduced Einstein equations, providing that the null cones of the 
matter equations coincide with or lie within the null cone of the space— 
time metric g. In the case of the scalar field or the electromagnetic 
potential which obey linear equations, these conditions follow from 
proposition 7.4.7. One can also deal with a scalar field coupled to the 
electromagnetic potential; one fixes the metric and the electro- 
magnetic potential, solves the scalar field as a linear equation in that 
metric and potential, and then solves the electromagnetic field in the 
given metric with the scalar field as the source. Iterating this procedure 
one can show that one converges on a set of the form YW, to a solution 
of the coupled scalar and electromagnetic equations in the given 
metric, providing that the initial data are sufficiently small. One then 
shows, by rescaling the metric and the fields, that for Y,, sufficiently 
small (as measured by the space-time metric §) one can obtainasolution 
for any suitable initial data. The same procedure will work for any finite 
number of coupled quasi-linear second order hyperbolic equations, 
where the coupling does not involve derivatives higher than the first. 

The equations of a perfect fluid are not second order hyperbolic, but 
form a quasi-linear first order system. (For the definition of a first order 
hyperbolic system, see Courant and Hilbert (1962), p. 577.) Similar 
results can be obtained for such systems providing that the ray cone 
coincides with or lies within the ‘null cone of the space-time with 
metric g. The requirement that the matter equations should be second 
order hyperbolic equations or first order hyperbolic systems with their 
cones coinciding with or lying within that of the space-time metric §, 
may be thought of as a more rigorous form of the local causality 
postulate of chapter 3. 

With the conditions (a), (6) and (c) one can establish propositions 
7.5.1 and 7.5.2 for the combined reduced Einstein’s equations and the 
matter equations; from these, the local and global Cauchy develop- 
ment theorems and the Cauchy stability theorem follow. 


8 


Space-time singularities 


In this chapter, we use the results of chapters 4 and 6 to establish some 
basic results about space-time singularities. The astrophysical and 
cosmological implications of these results are considered in the next 
chapters. 

In §8.1, we discuss the problem of defining singularities in space— 
time. We adopt b-incompleteness, a generalization of the idea of 
geodesic incompleteness, as an indication that singular points have 
been cut out of space-time, and characterize two possible ways in 
which b-incompleteness can be associated with some form of curvature 
singularity. In §8.2, four theorems are given which prove the existence 
of incompleteness under a wide variety of situations. In §8.3 we give 
Schmidt’s construction of the b-boundary which represents the 
singular points of space-time. In §8.4 we prove that the singularities 
predicted by at least one of the the theorems cannot be Just a dis- 
continuity in the curvature tensor. We also show that there is not only 
one incomplete geodesic, but a three-parameter family of them. In 
§8.5 we discuss the situation in which the incomplete curves are totally 
or partially imprisoned in a compact region of space-time. This is 
shown to be related to non-Hausdorff behaviour of the b-boundary. 
We show that in a generic space-time, an observer travelling on one of 
these incomplete curves would experience infinite curvature forces. 
We also show that the kind of behaviour which occurs in Taub-NUT 
space cannot happen if there is some matter present. 


8-1 The definition of singularities 


By analogy with electrodynamics one might think it reasonable to 

define a space-time singularity as a point where the metric tensor was 

undefined or was not suitably differentiable. However the trouble with 

this is that one could simply cut out such points and say that the 

remaining manifold represented the whole of space-time, which would 

then be non-singular according to this definition. Indeed, it would seem 
[ 256 ] 
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inappropriate to regard such singular points as being part of space— 
time, for the normal equations of physics would not hold at them and 
it would be impossible to make any measurements. We therefore 
defined space-time in $3.1 as a pair (W,§) where the metric g is 
Lorentzian and suitably differentiable and we ensured that no regular 
points were omitted from the manifold - along with the singular 
points by requiring that (.W,§) could not be extended with the 
required differentiability. 

The problem of defining whether space-time has a singularity now 
becomes one of determining whether any singular points have been 
cut out. One would hope to recognize this by the fact that space-time 
was incomplete in some sense. 

In the case of a manifold W with a positive definite metric g, one 
can define a distance function p(x, y) which is the greatest lower bound 
of the length of curves from x to y. The distance function p(x, y) is 
a metric in the topological sense; that is, a basis for the open sets of W 
is provided by the sets A(x, r) consisting of all points y¢-H such that 
p(x,y) < r. The pair (-4, 8) is said to be metrically complete (m-complete) 
if every Cauchy sequence with respect to the distance function p 
converges to a point in .@. (A Cauchy sequence is an infinite sequence 
of points x, such that for any € > 0 there is a number N such that 
P(@,,,X,,) < € whenever n and m are greater than N.) An alternative 
formulation is that (4,8) is m-complete if every C1 curve of finite 
length has an endpoint in the sense of §6.2 (note that the curve need 
not be C1 at the endpoint). It therefore follows that m-completeness 
implies geodesic completeness (g-completeness), that is every geodesic 
can be extended to arbitrary values of its affine parameter. In fact it 
can be shown (see Kobayashi and Nomizu (1963)) that g-completeness 
and m-completeness are equivalent for a positive definite metric. 

A Lorentz metric, on the other hand, does not define a topological 
metric and so one is left only with g-completeness. One can distinguish 
three kinds of g-incompleteness: that of timelike, null and spacelike 
geodesics. If one cuts a regular point out of space-time, the resulting 
manifold is incomplete in all three ways and so one might hope that 
a space-time which was complete in one of the above senses would also 
be complete in the other two. Unfortunately this is not necessarily so 
(Kundt (1963)), as is shown by the following example given by Geroch 
(1968b). Consider two-dimensional Minkowski space with coordinates 
x and ¢ and metric g,,. Define a new metric 9,, = Q?g,, where the 
positive function Q has the properties: 
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) Q = 1 outside the region between the vertical lines x = —1 and 
=+1; 
ce ) Qis symmetric about the t-axis, that is, Q(t, 7) = Q(t, — 2); 

(3) on the ¢-axis, #?}0Q—> 0 as too. 

By (2) the t-axis is a timelike geodesic which by (3) is incomplete as 
t->0o. However every null and spacelike geodesic must leave and not 
re-enter the region between x = — 1 and x = +1. Therefore by (1) the 
space is null and spacelike complete. In fact one can construct 
examples which are incomplete in any of the three possible ways and 
complete in the remaining two. 

Timelike geodesic incompleteness has an immediate physical signifi- 
cance in that it presents the possibility that there could be freely 
moving observers or particles whose histories did not exist after (or 
before) a finite interval of proper time. This would appear to be an 
even more objectionable feature than infinite curvature and so it 
seems appropriate to regard such a space as singular. Although the 
affine parameter on a null geodesic does not have quite the same 
physical significance as proper time does on timelike geodesics, one 
should probably also regard a null geodesically incomplete space-time 
as singular both because null geodesics are the histories of zero rest- 
mass particles and because there are some examples (such as the 
Reissner-Nordstrém solution, §5.5) which one would tRink of as 
singular but which are timelike but not null geodesically complete. 
As nothing moves on spacelike curves, the significance of spacelike 
geodesic incompleteness is not so clear. We shall therefore adopt the 
view that temelike and null geodesic completeness are minimum condi- 
tions for space-time to be considered singularity-free. Therefore if a 
space-time is timelike or null geodesically incomplete, we shall say 
that it has a singularity. 

The advantage of taking timelike and/or null incompleteness as 
being indicative of the presence of a singularity is that on this basis 
one can establish a number of theorems about their occurrence. How- 
ever, the class of timelike and/or null incomplete space—times does not 
include all those one might wish to consider as singular in some sense. 
For example Geroch (19686) has constructed a space-time which is 
geodesically complete but which contains an inextendible timelike 
curve of bounded acceleration and finite length. An observer with 
a suitable rocketship and a finite amount of fuel could traverse this 
curve. After a finite interval of time he would no longer be represented 
by a point of the space-time manifold. If one is going to say that there 
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is a singularity in a space-time in which a freely falling observer comes 
to an untimely end, one should presumably do the same for an 
observer in a rocketship. What one needs is some generalization of the 
concept of an affine parameter to all C! curves, geodesic or non- 
geodesic. One could then define a notion of completeness by requiring 
that every C! curve of finite length as measured by such a parameter 
had an endpoint. The idea we are going to use seems to have been first 
suggested by Ehresman (1957), and has been reformulated in an 
elegant manner by Schmidt (1971). 

Let A(t) be a C! curve through pe.@ and let (Egat 1,2,3;4) be 
a basis for T,. One can parallelly propagate {E,} along A(t) to obtain 
a basis for Ty for each value of ¢. Then the tangent vector 
V = (c/ét),, can be expressed in terms of the basis as V = V‘(é) E,, and 
one can define a generalized affine parameter u on A by 


u -{ (> VtV*) dé. 
wg 

The parameter u depends on the point p and the basis {E;} at p. If {E,,} 

is another basis at p, then there is some non-singular matrix A,/ such 


that E, = a5 AE, 
7’ 


As {E,} and {E,} are parallelly transported along A(t), this relation is 
maintained with constant A,’’. Thus 


Vit) = DA; Vi(t). 
j 
Since A,’ is a non-singular matrix, there is some constant C > 0 such 
et CUVVi < SVEVE < CAS VV. 
i v i 


Thus the length of a curve A is finite in the parameter wu if and only if 
it is finite in the parameter wu’. If Ais a geodesic curve then w is an affine 
parameter on A, but the beauty of the definition is that « can be defined 
on any C! curve. We shall say that (4,8) is b-complete (short for 
bundle complete, see §8.3) if there is an endpoint for every C1 curve 
of finite length as measured by a generalized affine parameter. If the 
length is finite in one such parameter it will be finite in all such 
parameters, so one loses nothing by restricting the bases to be ortho- 
normal bases. If the metric g is positive definite, the generalized affine 
parameter defined by an orthonormal basis is arc-length and so 
b-completeness coincides with m-completeness. However b-complete- 
ness can be defined even if the metric is not positive definite; in fact it 
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can be defined providing there is a connection on -@. Clearly 
b-completeness implies g-completeness, but the example quoted 
shows that the converse is not true. 

We shall therefore define a space-time to be singularity-free if it is 
b-complete. This definition conforms with the requirement made 
above, that timelike and null geodesic completeness are minimum 
conditions for a space-time to be considered singularity-free. One 
might possibly wish to weaken this condition slightly, to say that 
space-time is singularity-free it it is only non-spacelike b-complete, 
i.e. if there is an endpoint for all non-spacelike C1 curves with finite 
length as measured by a generalized affine parameter. However this 
definition would appear rather awkward in the bundle formulation of 
b-completeness which we shall give in § 8.3. In fact each of the theorems 
we give in §8.2 implies that (.4, g) is timelike or null g-incomplete and 
hence has a singularity by both the above definitions. 

One feels intuitively that a singularity ought to involve the curva- 
ture becoming unboundedly large near a singular point. However 
since we have excluded singular points from our definition of space— 
time, difficulty arises in defining both ‘near’ and ‘ unboundedly large’. 
One can say that points on a b-incomplete curve are near the singu- 
larity if they correspond to values of a generalized affine parameter 
which is near the upper bound of that parameter. ‘UiMboundedly 
large’ is more difficult, since the size of components of the curvature 
tensor depend on the basis in which it is measured. One possibility is 
to look at scalar polynomials in 9,4, apeq, ANd Ray.q. We shall say that 
a b-incomplete curve corresponds to a scalar polynomial curvature 
singularity (s.p. curvature singularity) if any of these scalar poly- 
nomials is unbounded on the incomplete curve. However, with a 
Lorentz metric these polynomials do not fully characterize the 
Riemann tensor since, as Penrose has pointed out, in plane-wave 
solutions the scalar polynomials are all zero but the Riemann tensor 
does not vanish. (This is similar to the fact that a non-zero vector may 
have zero length.) Thus the curvature might become very large in 
some sense even though the scalar polynomials remained small. 
Alternatively one might measure the components of the curvature 
tensor in a basis that was parallelly propagated along a curve. We shall 
say that a b-incomplete curve corresponds to a curvature singularity 
with respect to a parallelly propagated basis (a p.p. curvature singu- 
larity) if any of these components is unbounded on the curve. Clearly 
an s.p. curvature singularity implies a p.p. curvature singularity. 
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One might expect that in any physically realistic solution, a 
b-incomplete curve would correspond both to an s.p. and a p.p. 
curvature singularity. However an example of a solution where this 
does not seem to be true is provided by Taub-NUT space (§5.8). Here 
the incomplete geodesics are totally imprisoned in a compact neigh- 
bourhood of the horizon. As the metric is perfectly regular on this 
compact neighbourhood, the scalar polynomials in the curvature 
remain finite. Because of the special nature of this solution, the com- 
ponents of the curvature in a parallelly propagated basis along the 
imprisoned geodesics remains bounded. Since the imprisoned geo- 
desics are contained in a compact set, one could not extend the 
manifold .# to a larger four-dimensional Hausdorff paracompact 
manifold .@’, in which the incomplete geodesics could be continued. 
Thus there is no possibility of the incompleteness having arisen from 
the cutting out of singular points. Nevertheless it would be unpleasant 
to be moving on one of the incomplete timelike geodesics for although 
one’s world-line never comes to an end and would continue to wind 
round and round inside the compact set, one would never get beyond 
a certain time in one’s life. It would, therefore, seem reasonable to say 
that such a space-time was singular even though there is no p.p. ors.p. 
curvature singularity. By lemma 6.4.8, such totally imprisoned in- 
completeness can only occur if strong causality is violated. In §8.5 we 
shall show that in a generic space-time, a partially or totally im- 
prisoned b-incompléte curve will correspond to a p.p. curvature 
singularity. We shall also show that the Taub-NUT kind of totally 
imprisoned incompleteness cannot occur if there is some matter 
present. 


8.2 Singularity theorems 


In §5.4 it was shown that there would be singularities in spatially 
homogeneous solutions under certain reasonable conditions. Similar 
theorems can be obtained for a number of other types of exact sym- 
metry. Such results, although suggestive, do not necessarily have any 
physical significance because they depend on the symmetry being 
exact and clearly in any physical situation this will not be the case. It 
was therefore suggested by a number of authors that singularities 
were simply the result of symmetries and that they would not occur in 
general solutions. This view was supported by Lifshitz, Khalatnikov 
and co-workers who showed that certain classes of solutions with space- 
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like singularities did not have the full number of arbitrary functions 
expected in a general solution of the field equations (see Lifshitz and 
Khalatnikov (1963) for an account of this work). This presumably 
indicates that the Cauchy data which gave rise to such singularities 
is of measure zero in the set of all possible Cauchy data and so should 
not occur in the real universe. However more recently Belinskii, 
Khalatnikov and Lifshitz (1970) have found other classes of solutions 
which seem to have the full number of arbitrary functions and to 
contain singularities. They have therefore withdrawn the claim that 
singularities do not occur in general solutions. Their methods are 
interesting for the light they shed on the possible structure of singu- 
larities but it is not clear whether the power series which are used will 
converge. Neither does one obtain general conditions which imply that 
a singularity is inevitable. Nevertheless we may take their results as 
supporting our view that the singularities implied by the theorems of 
this section involve infinite curvature in general. 

The first theorem about singularities which did not involve any 
assumption of symmetry was given by Penrose (1965c). It was 
designed to prove the occurrence of a singularity in a star which 
collapsed inside its Schwarzschild radius. If the collapse were exactly 
spherical, the solution could be integrated explicitly and a singularity 
would always occur. However it is not obvious that this would be the 
case if there were irregularities or a small amount of angular 
momentum. Indeed in Newtonian theory the smallest amount of 
angular momentum could prevent the occurrence of infinite density 
and cause the star to re-expand. However Penrose showed that the 
situation was very different in General Relativity: once the star had 
passed inside the Schwarzschild surface (the surface r = 2m) it could 
not come out again. In fact the Schwarzschild surface is defined only 
for an exactly spherically symmetric solution but the more general 
criterion used by Penrose is equivalent for such a solution and is 
applicable also to solutions without exact symmetry. It is that there 
should exist a closed trapped surface 7. By this is meant a C? closed 
(i.e. compact, without boundary) spacelike two-surface (normally, S2) 
such that the two families of null geodesics orthogonal to 7 are con- 
verging at 7 (i.e. 2,9 and 2%,,9” are negative, where ,%,,, and .%,, 
are the two null second fundamental forms of ZY. In the following 
chapters we shall discuss the circumstances under which such a surface 
would arise.) One may think of 7 as being in such a strong gravita- 
tional field that even the ‘outgoing’ light rays are dragged back and 
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are, in fact, converging. Since nothing can travel faster than light, the 
matter within 7 is trapped inside a succession of two-surfaces of 
smaller and smaller area and so it seems that something must go 
wrong. That this is so is shown rigorously by Penrose’s theorem: 


Theorem 1 

Space-time (4, 8) cannot be null geodesically complete if: 
(1) &,,K*K° > 0 for all null vectors K@ (cf. §4.3); 
(2) there is a non-compact Cauchy surface ¥ in 4; 
(3) there is a closed trapped surface 7 in M. 


Note: the method of proof is to show that the boundary of the future 
of 7 would be compact if W were null geodesically complete. This is 
then shown to be incompatible with # being non-compact. 


Proof. The existence of a Cauchy surface implies that -@ is globally 
hyperbolic (proposition 6.6.3) and therefore causally simple (proposi- 
tion 6.6.1). This means that the boundary of J+(7) will be E+(7) and 
will be generated by null geodesic pemesielits which have past endpoints 
on Y and which are orthogonal to 7. Suppose -@ were null geo- 
desically complete. Then by aacitiade (1) and (3) and proposition 
4.4.6 there would be a point conjugate to Z along every future- 
directed null geodesic orthogonal to Z within an affine distance 2c-1 
where c is the value of ,,#,,9g% at the point where the null geodesic 
intersects 7. By proposition 4.5.14, points on such a null geodesic 
beyond the point conjugate to Z would lie in I+(7). Thus each 
generating segment of /+(7) would have a future endpoint at or 
before the point conjugate to 7. At 7 one could assign, in a con- 
tinuous manner, an affine parameter on each null geodesic orthogonal 
to 7. Consider the continuous map /: 7 x[0,b]xQ>.@ (Q is the 
discrete set 1, 2) defined by taking a point peJ an affine distance 
ve [0,6] along one or other of the two future-directed null geodesics 
through p orthogonal to 7 Since J is compact, there will be some 
minimum value c, of (—1%,,9°") and (—.%,9%). Then if by = 2c,)7!, 
B(T x [0,b9]x@) would contain /+(7). Thus J+(Z) would be 
compact being a closed subset of a compact set. This would be possible 
if the Cauchy surface # were compact because then J*(7) could 
meet up round the back and form a compact Cauchy surface homeo- 
morphic to # (figure 49). However there is clearly going to be trouble 
if one demands that # is non-compact. To show this rigorously one 
can use the fact (see §2.6) that admits a past-directed C1 timelike 
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Fricure 49. A two-dimensional section of a geodesically complete space with 
a compact Cauchy surface #. The two-sphere 7 has a compact boundary 
J+(7) to its future J+(7), as the outgoing null geodesics from YF meet up 
round the back of the cylinder. 


~ 


vector field. Each integral curve of this field will intersect # (as it is 
a Cauchy surface) and will intersect J+(7) at most once. Thus they 
will define a continuous one-to-one map a: J+(7)> #. If J+(7) were 
compact, its image «(J+(7)) would also be compact and would be 
homeomorphic to /+(:7). However as # is non-compact, a(J+(Z)) 
could not contain the whole of “ and would therefore have to have 
a boundary in #. This would be impossible since by proposition 6.3.1, 
J+(Z), and therefore a(J+(7)), would be a three-dimensional mani- 
fold (without boundary). This shows that the assumption that -@ is 
null geodesically complete (which we made in order to prove J+(7) 
compact) is incorrect. Oo 


Condition (1) of this theorem (that R,,A7A® > 0 for any null vector K) 
was discussed in $4.3. It will hold no matter what value the value of 
the constant A, provided that the energy density is positive for every 
observer. It will be shown in chapter 9 that condition (3) (that there is 
a closed trapped surface) should be satisfied in at least some region 
of space-time. This leaves condition (2) (that there is a non-compact 
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spacelike surface # which is a Cauchy surface) to be discussed. By 
proposition 6.4.9, the existence of spacelike surfaces is guaranteed 
provided one assumes stable causality. That the spacelike surface # 
be non-compact is not too serious a restriction since the only place it 
was used was to show that «(J+(Z)) could not be the whole of #. 
This could also be shown if, instead of taking ¥ to be non-compact, 
one required that there exist a future-directed inextendible curve 
from # which did not intersect J+(Z7). In other words, the theorem 
would still hold even if # were compact, provided there was some 
observer who could avoid falling into the collapsing star. This might 
not be possible if the whole universe were collapsing also, but in such 
a case one would expect singularities anyway as will be shown 
presently. The real weakness of the theorem is the requirement that 
H be a Cauchy surface. This was used in two places: first, to show that 
M was causally simple which implied that the generators of J+(7) 
had past endpoints on 7, and second, to ensure that under the map « 
every point of J+(.7) was mapped into a point of #. That the Cauchy 
surface condition is necessary is shown by an example due to Bardeen. 
This has the same global structure as the Reissner—-Nordstrom solution 
except that the real singularities at r = 0 have been smoothed out so 
that they are just the origins of polar coordinates. The space-time 
obeys the condition R,, K7K® > 0 for any null but not timelike vector 
K, and contains closed trapped surfaces. The only way in which it fails 
to satisfy the conditions of the theorem is that it does not have 
a Cauchy surface. 

It therefore seems that what the theorem tells us is that in a col- 
lapsing star there will occur either a singularity or a Cauchy horizon. 
This is a very important result since in either case our ability to pre- 
dict the future breaks down. However it does not answer the question 
of whether singularities occur in physically realistic solutions. To 
decide this we need a theorem which does not assume the existence of 
Cauchy surfaces. One of the conditions of such a theorem must be that 
Ri,KetK® > 0 for all timelike as well as null vectors, since failure to 
obey this condition is the only way in which Bardeen’s example is 
unreasonable. The theorem we shall give below requires this condition 
and also the chronology condition that there be no closed timelike 
curves. On the other hand it is applicable to a wider class of situations 
since the existence of a closed trapped surface is now only one of three 
possible conditions. One of these alternative conditions is that there 
should be a compact partial Cauchy surface, and the other is that there 
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Past light cone 


FicurE 50. A point p whose past light cone starts reconverging. 


should be a point whose past (or future) ight cone starts converging 
again (figure 50). The first of these other conditions is satisfied in 
a spatially closed solution while the second is closely related to the 
existence of a closed trapped surface but is in a form which is more 
convenient for some purposes; for in the case in which the light cone 
is our own past light cone, one can directly determine whether this 
condition is satisfied. In the last chapter it will be shown that recent 
observations of the microwave background indicate that it is. 
The precise statement is: 


Theorem 2 (Hawking and Penrose (1970)) 


Space-time (.#, §) is not timelike and null geodesically complete if: 

(1) R,,A*K® > 0 for every non-spacelike vector K (cf. $4.3). 

(2) The generic condition is satisfied (§4.4), i.e. every non-spacelike 
geodesic contains a point at which A, Ry ateA )K°K? + 0, where K is 
the tangent vector to the geodesic. 

(3) The chronology condition holds on .# (i.e. there are no closed 
timelike curves). 

(4) There exists at least one of the following: 

(i) a compact achronal set without edge, 
(ii) a closed trapped surface, 
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(ill) a point p such that on every past (or every future) null geodesic 
from p the divergence @ of the null geodesics from p becomes negative 
(i.e. the null geodesics from p are focussed by the matter or curvature 
and start to reconverge). 


Remark. An alternative version of the theorem is that the following 
three conditions cannot all hold: 

(a) every inextendible non-spacelike geodesic contains a pair of 
conjugate points; 

(6) the chronology condition holds on .#: 

(c) there is an achronal set .Y such that E+(S) or E-(S) is compact. 
(We shall say that such a set is, respectively, future trapped or past 
trapped). 

In fact it is this form of the theorem that we shall prove. The other 
version will then follow since if W were timelike and null geodesically 
complete, (1) and (2) would imply (a) by propositions 4.4.2 and 4.4.5, 
(3) is the same as (b), and (1) and (4) would imply (c), since in case (i) 
F would be the compact achronal set without edge and 


E+(S) = ES) =F; 


in cases (ii) and (iii) “ would be the closed trapped surface and the 
point p respectively, and by propositions 4.4.4, 4.4.6, 4.5.12 and 
4.5.14 H*(S) and E-(/) would be compact respectively, being the 
intersections of the closéd sets J+(SY) and J-(SY) with compact sets 
consisting of all the null geodesics of some finite length from . 


Proof. As the proof is rather long, we shall break it up by first estab- 
lishing a lemma and corollary. We note that by an argument similar 
to that of proposition 6.4.6, (a) and (6) imply that strong causality 
holds on -#@. 


Lemma 8.2.1 
If ¥ is a closed set and if the strong causality condition holds on 
J+(S) then H+(H+(f)) is non-compact or empty (figure 51). 


By lemma 6.3.2, through every point qedt(FS )-—¥ there is a past- 
directed null geodesic segment lying in J+(.%) which has a past end- 
point if and only if ge #t(S). (Note that as we no longer assume the 
existence of a Cauchy surface, .W may not be causally simple and so 
J+(S) —E+(S) may be non-empty.) Therefore if geJ +() — ES), 
there is a past-inextendible null geodesic through q which lies in J+(.Y) 
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Identify 


Identify 


H*(E*()) 


H+(E*(F)) 


Ficure 51. A future trapped set ; nul] lines are at + 45°, three lines have been 
identified and the points q are at infinity. The achronal sets E+(/), J+(S) and 
H+(E+(.Y)) are shown. A future-inextendible timelike curve ye D*(E7(f)) is 
shown. 


and so does not intersect [-(J+(S)). From lemma 6.6.4 it then follows 
that q is not in D+(J+(SA)) —H*+(J+(F)). Hence 


Dt(B+(S))—H+(E(S)) = D+(J+(F)) —H+(I +(F)) 
and H+(E+(f)) < H+(J+(P)). 


Now suppose that H+(#+(S)) was non-empty and compact. Then 
it could be covered by a finite number of local causality neighbour- 
hoods %,. Let p, be a point of J+(Y)n [W,— Dt(J+(F))]. Then from 
p, there would be a past-inextendible non-spacelike curve A, which did 
not intersect either J+(.S) or D+(E+(S)). Since the U, have compact 
closure, A, would leave Y,. Let q, be a point on A, not in Y,. Then since 
q,€J*(S) there would be a non-spacelike curve , from q, to Y. This 
curve would intersect D+(E+(S)) and hence would intersect some 7%, 
other than %, (say, Y,). Then let p, be a point of u, 9 [%,—D+(J+(SF))} 
and continue as before. 

This leads to a contradiction since there were only a finite number 
of the local causality neighbourhoods Y,, and one could not return to 
an earlier Y%; because no non-spacelike curve can intersect a Y; more 
than once. Thus H+(£+()) must be non-compact or empty. O 


Corollary 


If S is a future trapped set, there is a future-inextendible timelike 
curve y contained in Dt(EH+(f)). 
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Put a timelike vector field on -#. If every integral curve of this field 
which intersected H+(S) also intersected H+(H+(S)) they would 
define a continuous one-one mapping of E+(/) onto H+(E+(S)) and 
hence H*(E*(S)) would be compact. The intersection of I+(Y) with 
a curve which does not intersect H+(E+(S)) gives the desired curve y 
(figure 51 indicates one possible situation). O 


Now consider the compact set F defined as E+(S)n J-(y). Since 
y was contained in intI+(£+(S)), E-(F) would consist of ¥ and 
a portion of J~(y). Since y was future inextendible, the null geodesic 
segments generating /~(y) could have no future endpoints. But by (a) 
every inextendible non-spacelike geodesic contains a pair of conjugate 
points. Thus by proposition 4.5.12, the past-inextendible extension v’ 
of each generating segment v of J~-(y) would enter J~(y). There would 
be a past endpoint for v at or before the first point p of v’n I-(y). 
As I~(y) would be an open set, a neighbourhood of p would contain 
points in /~(y) on neighbouring null geodesics. Thus the affine distance 
of the points p from Y would be upper semi-continuous, and E-(F) 
would be compact being the intersection of the closed set J~(y) with 
a compact set generated by null geodesic segments from F of some 
bounded affine length. It would then follow from the lemma that 
there would be a past-inextendible timelike curve A contained in 
int D-(E-(F )) (figure 52). Let a,, be an infinite sequence of points on A 
such that: pies 

(I) Ay41€1-(a,), 

(II) no compact segment of A contains more than a finite number 
of the a,,. 


Let b,, be a similar sequence on y but with J* instead of J~ in (I) and 
with b, e€[*(a,). 

As y and A were contained in the globally hyperbolic set 
int D(E-(F )) (proposition 6.6.3), there would be a non-spacelike geo- 
desic u,, of maximum length between each a, and the corresponding 
b,, (proposition 6.7.1). Each would intersect the compact set Et(7). 
Thus there would be a ge Ht(S) which was a limit point of the 
fi, E*(F) and a non-spacelike direction at q which is a limit of the 
directions of the y,,. (The point g and the direction at q define a point 
of the bundle of directions over #. Such a limit point exists because 
the portion of the bundle over E*(S%) is compact.) Let py’, be a 
subsequence of the ,, such that yw’, E*(/) converges to q and such 
that the directions of the p’,, at H+(/) converge to the limit direction. 
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Ficure 52. As figure 51, but with three further lines identified. ¥ is the set 
E+(S)nJ-(y); the points p are past endpoints of null geodesic generating 
segments of E-(¥ ). The curve A is a past-inextendible timelike curve contained 
in int D-(E-(F)). 


(More precisely, the points defined by they’, in the bundle ef directions 
over E+(S) converge to the limit point.) Let « be the inextendible 
geodesic through qg in the limit direction. By (a) there would be 
conjugate points x and y on uz with yel*(x). Let x’ and y’ be on » to 
the past and future of x and y respectively. By proposition 4.5.8, there 
is some € > 0 and some timelike curve x from 2’ to y’ whose length 
is € plus the length of « from x’ to y’. Let Y and ¥ be convex normal 
coordinate neighbourhoods of x’ and y’ respectively, each of which 
contains no curve of length te. Let x” and y” be Wn a and Vp 
respectively. Let x’, and y’, be points on yw’, converging to 2’ and y’ 
respectively. For sufficiently large, the length x’, from x’, to y’,, will 
be less than je plus the length of u from a’ to y’. Also for n sufficiently 
large, x’, and y’,, would be in I-(x”, ZY) and I+(y",W) respectively. 
Then going from x’, to x”, along a to y”, and from y” to y’,, would 
give a longer non-spacelike curve than yw’, from 2’, to y’,. But by 
property (II), a’,, would lie to the past of x’, on yw’, and b’,, would lie 
to the future of y’,, on y’,,, for n large enough. Therefore y’,, ought to 
be the longest non-spacelike curve from 2’, to y’,,. This establishes the 
desired contradiction. Oo 
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While this theorem establishes the existence of singularities under 
very general conditions, it has the disadvantage of not showing 
whether the singularity is in the future or the past. In case (ii) of 
condition (4), when there is a compact spacelike surface, one has no 
reason to believe that it should be in the future rather than in the past, 
but in case (i) when there is a closed trapped surface, one would expect 
the singularity to be in the future, and in case (iii) when the past null 
cone starts reconverging, one would expect the singularity to be in the 
past. One can show that there is a singularity in the past if condition 
(ili) is strengthened somewhat to say that all past-directed timelike 
as well as null geodesics from p start to reconverge within a compact 
region in J~(p). 


Theorem 3 (Hawking (1967)) 
If (1) R,, K7K° > 0 for every non-spacelike vector K (cf. §4.3); 

(2) the strong causality condition holds on (4, §); 

(3) there is some past-directed unit timelike vector W at a point p 
and a positive constant b such that if V is the unit tangent vector to 
the past-directed timelike geodesics through p, then on each such 
geodesic the expansion 6 = V°., of these geodesics becomes less than 
— 3c/b within a distance b/c from p, where c = — W°Y,, 
then there is a past incomplete non-spacelike geodesic through p. 


Let K* be the parallelly propagated tangent vector to the past- 
directed non-spacelike geodesics through p, normalized by K*W, = —1. 
Then for the timelike geodesics through p, A*=c1V% and so 
K*,, =cV%,,. Since K?,, is continuous on the non-spacelike geo- 
dlasies: it will — less ‘shoes — 3/b on the null geodesics through p 
within an affine distance b. If Y,, Y,, Y,; and Y, are a pseudo-ortho- 
normal tetrad on these null geodesics with Y, and Y, spacelike unit 
vectors and Y, and Y, null with Y,7Y,, = —1 and Y, = K, the expan- 
sion @ of the null geodesics through p is defined as 


6 a 1 nO Oe ee a Y,* ae?) 
= [Qeeee Oy OLA Ry ean te). 
The second term is zero because Kis parallelly propagated. The third 
term can be expressed as 4(K,K%).,Y3°, which is less than zero as 
K_,K* is zero on the null geodesics and negative for timelike geodesics. 


This shows that 6 will become less than — 3/6 within an affine distance b 
along each null geodesic from p. Thus if all past-directed null geodesics 
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from p were complete, E~(p) would be compact. Any point 
qeJ-(E~(p))—E~(p) would be in I~(p). Thus it could not be in 
J+(E-(p)) since H-(p) is achronal. Therefore 
J*(E-(p))n J-(E-(p)) = B(p) 

and so would be compact. Then by proposition 6.6.7, D-(E~(p)) would 
be globally hyperbolic. By proposition 6.7.1, each point re D-(E-(p)) 
would be joined to p by a non-spacelike geodesic which did not contain 
any point conjugate to p between r and p. Thus by proposition 4.4.1, 
D-(E-(p)) would be contained in exp, (f) where F' is the compact 
region of 7, consisting of all past-directed non-spacelike vectors ke 
such that K¢*W, < — 2b. If all past non-spacelike geodesics from p were 
complete, exp, (K*) would be defined for every K*¢F’, and so exp, (Ff) 
would be compact being the image of a compact set under a continuous 
map. However by the corollary to lemma 8.2.1, D-(£~(p)) contains 
a past-inextendible timelike curve. By proposition 6.4.7 this could not 
be totally imprisoned in the compact set exp,(F), therefore the 
assumption that all past-directed non-spacelike geodesics from p are 
complete must be false. Oo 


Theorems 2 and 3 are the most useful theorems on singularities since 
it can be shown that their conditions are satisfied in a number of 
physical situations (see next chapter). However it might bathat what 
occurred was not a singularity but a closed timelike curve, violating 
the causality conditions. This would be much worse than the mere 
breakdown of prediction which was the alternative after theorem 1, 
and it is our personal opinion that it would be physically more objec- 
tionable than a singularity. Nevertheless one would like to know 
whether such causality violations would prevent the occurrence of 
singularities. The following theorem shows that they cannot in certain 
situations. This means that we have to take singularities seriously and 
it gives us confidence that, in general, causality breakdowns are not 
the way out. 


Theorem 4 (Hawking (1967)) 
Space-time is not timelike geodesically complete if: 
(1) K,,A*K° > 0 for every non-spacelike vector K (ef. § 4.3); 


(2) there exists a compact spacelike three-surface Y (without 
edge); 


(3) the unit normals to Y are everywhere converging (or every- 
where diverging) on /. 
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Remarks. Condition (2) may be interpreted as saying that the universe 
is spatially closed and condition (3) as saying that it is contracting 
(or expanding). As explained in §6.5 one may take a covering manifold 
M in which each connected component of the i image of S is diffeo- 
morphicto ¥ and isa partial Cauchy surface in A We shall work in.# 
and shall denote by one connected component ¢ of the image of /. 
Considering the Cauchy evolution problem in ./ one sees that the 
occurrence of singularities (though not necessarily their nature) is a 
stable property of the Cauchy data on F since a sufficiently small 
variation of the data on F will not violate condition (3). This is a 
counterexample to the conjecture by Lifshitz and Khalatnikov that 
singularities occur only for a set of Cauchy data of measure zero, 
though it must be remembered that the definition of a singularity 
adopted here is not that used by Lifshitz and Khalatnikov 


Proof. By conditions (2) and (3) the contraction y%, of the second 
fundamental form of ¥ has a negative upper bound on ¥. Thus if W 
(and hence .@) was timelike geodesically complete there would be 
a point conjugate to F on every future-directed geodesic orthogonal 
to F within a finite upper bound 6 of distance from (proposition 
4.4.3). But by the corollary to proposition 6.7.1, to every point 
qe D+(F) there is a future-directed geodesic orthogonal to f which 
does not contain any point conjugate to SF between F and q. Let 
B: F x [0, b] +.é be the differentiable map which takes a point peS 
a distance s€[0,6] up the future-directed geodesic through p ortho- 
gonal to Y. Then KMS x [0,5]) would be compact and would contain 
D+(f). Thus D+(F) and hence H+(S) would be compact. If one 
assumed the strong causality condition the desired contradiction 
would follow from lemma 8.2.1. However even without strong 
causality one can obtain a contradiction. Consider a point ge H +(F). 
Since every past-directed non-spacelike curve from q to SF would 
consist of a (possibly zero) null geodesic segment in H+(f) and then 
a non-spacelike curve in D+(f), it follows that d(F, q) would be less 
than or equal to b. Thus, as d is lower semi-continuous, one could find 
an infinite sequence of points r,€ DUP ) converging to q such that 
US, n) converged to UP, q): To each 7, there would correspond at 
least one element f-(r,,) of F x [0, b]. Since F x [0, b] is compact there 
would be an element i s) which was a limit point of the f~1(r,,). By 
continuity s = d(f,q) and f(p,s) = g. Thus to every point qe 7) 
there would be a timelike geodesic of length d( S,q) from S. Now let 
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q,eH lS ) lie to the past of g on the same null geodesic generator A of 
H+(f). Joining the geodesic of length d(f,q,) from F to q, to the 
segment of A between qg, and g, one would obtain a non- -spacelike curve 
of length d(F, q,) from F to q which could be varied to give a longer 
curve between these endpoints (proposition 4.5.10). Thus d(F,q), 

qeH a ), would strictly decrease along every past-directed generator 
of H+(f). But by proposition 6.5.2, such generators could have no 
past endpoints. This leads to a contradiction since as d(f, 9) is lower 
semi-continuous in g, it would have a minimum on the compact 


set H+(F). Oo 


Condition (2) that Y is compact is necessary, since in Minkowski space 
(@,1) the non-compact surface S: (x1)? + (x?)?+ (x3)? (x4)? = —1, 
x* < 0, is a partial Cauchy surface with v7, = — 3 at all points. If one 
took the region of Minkowski space defined by 


at< 0, (21)? 4+ (27 eyr ae), 


one could identify points under a discrete group of isometries G such 
that £/G was compact (Lobell (1931)). As required by theorem 4, the 
space (.4/G', n) would be timelike geodesically incomplete because one 
could not extend the identification under G to the whole of -@ (neither 
conditions (1) nor (2) of §5.8 would hold at the origin). In this case the 
incompleteness singularity arises from bad global properties and is not 
accompanied by a curvature singularity. This example was suggested 
by Penrose. 
Conditions (2) and (3) can be replaced by: 


(2’) Pisa Cauchy surface for AM: 
(3’) y%, is bounded away from zero on ; 


since in this case there cannot be a Cauchy horizon, yet all the future- 
directed timelike curves from Y must have lengths less than some 
finite upper bound. 

Geroch (1966) has shown that if condition (2) holds, and if conditions 
(1) and (3) are replaced by: 

(1”) A» A*K°? > 0 for every non-spacelike vector, equality holding 
only if R., = 

(3”) there isa ane peF such that any inextendible non-spacelike 
curve which intersects S also intersects both J+(p) and J~(p); 


then either the Cauchy deveWpitiens of F is flat, or @ is timelike 
geodesically incomplete. 
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Condition (3”) requires that an observer at p can see, and be seen by, 
every particle that intersects Y. The method of proof is to consider all 
spacelike surfaces without edge which contain p. One can form a 
topological space S(p) out of all these surfaces, in a manner analogous 
to that in which one forms a topological space out of all the non- 
spacelike curves between two points. Conditions (2) and (3”) then 
imply that S(p) is compact. One can show that the area of the surfaces 
is an upper semi-continuous function on S(p) and so there will be some 
surface “’ through p which has an area greater than or equal to that 
of any other surface. By a variation argument similar to that used for 
non-spacelike curves, one can show that y*, vanishes everywhere on 
SF’ except possibly at p, where the surface may not be differentiable. 

Consider a one-parameter family of spacelike surfaces S(u) where 
S(0) = F“'. The variation vector W = é/éu can be expressed as fn 
where n is the unit normal to the surfaces and f is some function. One 
can apply the Raychaudhuri equation to the congruence of integral 
curves of W to show 


d0/du = f{-—46?— 20? R,, nn? + ff, gy h™}, 
where O= Xa Far =Xav—$ hay, hav = Yan t+ Ma 
and t=... 04 


If there is some point ge S’ at which R,, n°’ + 0 or Xp + O one can 
find an f such that @0/éu is negative everywhere on S’. If R,,nm? and 
Xap Were zero everywhere on “’, but there was some point q on #” at 
which C,,,,.4n’n? was not equal to zero, then éa/@u + 0 and one could 
find an f such that @0/éu = 0 and @?6/éu? < 0 everywhere on #”. In 
either case, one would obtain a surface Y” on which yx%, < 0 every- 
where, and so .@ would be timelike geodesically incomplete by 
theorem 4. If Ry: Xap and C,y,qn°n? were zero everywhere on #”, then 
the Ricci identities for n* show that C,,.¢ = 0on /’. Hence space-time 
is flat in D(Y). An example in which conditions (1”), (2) and (3”) hold 
and in which D(S) is flat is Minkowski space with {a1, x, 2°, at} 
identifiedwith{¢!4-1, 22,22, 2}, {01,27 +1, 2°, Pp eanid (ae? dye}. 
This is geodesically complete. However the example given previously 
also satisfies these conditions and shows that D(.Y) can be both 
geodesically incomplete and fiat. 


276 SPACE-TIME SINGULARITIES [8.3 


8.3 The description of singularities 


The preceding theorems prove the occurrence of singularities in a large 
class of solutions but give little information as to their nature. To 
investigate this in more detail, one would need to define what one 
meant by the size, shape, location and so on ofa singularity. This would 
be fairly easy if the singular points were included in the space-time 
manifold. However it would be impossible to determine the manifold 
structure at such points by physical measurements. In fact there 
would be many manifold structures which agreed for the non-singular 
regions but which differed for the singular points. For example, the 
manifold at the ¢ = 0 singularity in the Robertson—Walker solutions 
could be that described by the coordinates 


{t, r cos 8, r sin # cos ¢, rsin O sin ¢} 
or that described by 
{t, Sr cos 6, Srsin 0 cos ¢, Sr sin 6 sin ¢}. 


In the first case the singularity would be a three-surface, in the second 
case a single point. 

What is needed is a prescription for attaching some sort of boundary 
@ to @ which is uniquely determined by measurements at non- 
singular points, i.e. by the structure of (.W, 8). One wouldthen like to 
define at least a topology, and possibly a differentiable structure and 
metric, on the space #* = WU @. One possibility would be to use the 
method of indecomposable infinity sets described in §6.8. However 
since this depends only on the conformal metric, it does not distinguish 
between infinity and singular points at a finite distance. To make this 
distinction it would seem one should base one’s construction for + 
on the criterion that has been adopted for the existence of a singularity : 
namely b-incompleteness. An elegant way of doing this has been 
developed by Schmidt. This supersedes earlier constructions by 
Hawking (1966) and Geroch (1968a) which defined the singular 
points as equivalence classes of incomplete geodesics. These construc- 
tions did not necessarily provide endpoints for all b-incomplete curves, 
such as incomplete timelike curves of bounded acceleration. There was 
also a certain ambiguity in their definition of equivalence classes. 
Schmidt’s construction does not suffer from these weaknesses. 

Schmidt’s procedure is to define a positive definite metric e on the 
bundle of orthonormal frames 7: O(M)-> M. Here O(.M) is the set of 
all orthonormal four-tuples of vectors {E,}, E,€7, for each pew 
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(a ranges from 1 to 4), and 7 is the projection which maps a basis at 
a point p to the point p. It turns out that O(.#) is m-incomplete in 
the metric e if and only if -@ is b-incomplete. If O(-#) is m-incomplete, 


one can form the metric space completion O(./) of O(.#) by Cauchy 


sequences. The projection 7 can be extended to O(.#), and the 


quotient of O(.@) by zis defined to be -@* which is the union of 4 with 
a set of additional points 0. The set @ consists of the singular points 
of -@ in the sense that it is the set of endpoints for every b-incomplete 
curve in 4. 

To perform this construction, we recall (§ 2.9) that the connection on 
M given by the metric § defines a four-dimensional horizontal subspace 

, of the ten-dimensional tangent space J, at the point we O(.4). 
Then 7, is the direct sum of H,, and the vertical subspace V, consisting 
of all the vectors in 7, which are tangent to the fibre 7—1(7(u)). We now 
construct a basis {G_,} = {E,, F,;} for T,, where A runs from 1 to 10, 
aruns from 1 to 4 and 7 runs from 1 to 6; {E,} is a basis for H,,, and 
{F,} is a basis for V,. 

Given any vector X€7,,,)(-@) there is a unique vector X €H,(O(¢M)) 
such that 7, X = X. Thus on O(-@) there are four uniquely defined 
horizontal vector fields E,, which are the horizontal lifts of the ortho- 
normal basis vectors E, for each point ue O(.#). The integral curves of 
the field E, in O(.M) represent parallel propagation of the basis {E,} 
along the geodesic in Win the direction of the vector E,. 

The group O(3, 1), the multiplicative group of all non-singular 4 x 4 
real Lorentz matrices A,,, acts in the fibres of O(.@) sending a point 
u = {p,E,}<¢O() to the point A(u) = {p, A, E,}eO(#). One can 
regard O(3, 1) as a six-dimensional manifold and represent the tangent 
space 7'(O(3, 1)) to O(3, 1) at the unit matrix J by the vector space of 
all 4 x 4 matrices a such that a,,G,. = — Gp pq. Then if ae T;(O(3, 1)), 
one can define a curve in O(3, 1) by A; = exp (ta) where 

co fn 
exp(6)= & 


nao! 


Thus if ~e€O(.#) one can define a curve through u in m'(m(w)) by 
Ant) = A;(u). As the curve A,,(¢) lies in the fibre, its tangent vector 
(d/ét),,,, is vertical. For each ae T,, one can therefore define a vertical 
vector field F(a) by F(a)|,, = (@/ét),,,|, for each weO(.@). If {a,} 
(i = 1,2,...,6) are a basis for T;, then F; = F(a;) will be six vertical 
vector fields on O(.@) which will provide a basis for V,, at each point 
UucO(M). 


10 HLS 
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A matrix Be 0(3, 1) defines a mapping O(.4)>O(4) by u> Blu). 
Under the induced map B,: 7,>T pi, the vertical and horizontal 
vector fields transform as follows: 


B,(E,) = BoE,, 
B,(F,;) = C7 F;, 


where Ci = Bop @ipe Boheq @ qq and {a5} are the basis for T*; dual to the 
basis {a,} for Tr(thus a/y,@jay = Oj, Cup Vea = 29 ac %va)- The property 
of these induced maps which will be important for what follows is not 
their actual form but the fact that they are constant over O(-4). 

One now has a basis {G,} = {E,, F,} (A = 1,..., 10) for T, at each 
point weO(-@). One can thus define a positive definite metric e on 
O(.M) by e(X, Y) = 5 X4Y4 where X, YeT(u) and X4, Y4 are the 

A 
components of X, Y respectively in the basis {G4}. 

Using the metric e, one can define a distance function p(u, v), 
u,veEO(m), as the greatest lower bound of lengths (measured by e) 
of curves from u to v. One can then ask whether O(.#) is m-complete 
with the distance function p. 


Proposition 8.3.1 
(O( 4), e) is m-complete if and only if (.W, 8) is b-complete. 


Suppose y(t) is a curve in #. Then given a point uem—1(p) where pey 
one can construct a horizontal curve y(t) through uw such that 
m(y(t)) = y(t). From the definition of the positive definite metric e, it 
follows that the arc-length of y(t) as measured in this metric is equal 
to the generalized affine parameter of y(t), defined by the basis at p 
represented by the point w. If therefore y(t) has no endpoint but has 
finite length as measured by the generalized affine parameter, then 
y(t) will also have no endpoint but will have finite length in the 
metric e. Thus m-completeness in O(-#) implies b-completeness in 4. 

To prove the converse, one needs to show that if A(t) is a C) curve in 
O(.#) of finite length without endpoint, then 7(A(t)) isa C! curve in # 
with 

(1) finite affine length, 

(2) no endpoint in #. 


To prove (1), one proceeds as follows. Let weA(t). Then one can 
construct a horizontal curve A(t) through u such that 7(A(t)) = m(A(#)). 
For each value of t, A(t) and A(t) will lie in the same fibre, so there will 
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be a unique curve A(t) in O(3,1) such that A(t) = Bit) A(t). This 
implies 3 P 

Bah = ce? Rot 

(a), = 2 (G), Pee, 


where B’ = dB/dt. Therefore 


@ e 2 ; , , 
(a), (3) }- = = (Es (5), 22) + E(B ay BM Mca)”: 


where {E“} is the basis of H*, dual to the basis {E,} (i.e. (E¢, E,) = 6%) 
and a’,, is the basis of 7;* dual to the basis a; ,, (i-€. @; 4, = 6). 
The matrix B,, satisfies B,, G,,.Ba. = Gaq- Therefore 


Bay Gee Bea = Goa 
as G,, = G-1,,. Differentiating with respect to ¢, one has 
B py B40 Gea = — Gace B ay B ne. 


Thus B',, 6-1,,€ T,(O(3, 1)). Since the a#,, are a basis for 7'*,, there is 
some constant C such that 


p> (Bia 12 ie Weal 2 C(B a Be Bai Bee: 


Any matrix BeO(3,1) can be expressed in the form B = QAQ, 
where (i) Q and Q are = matrices of the form 


6 0 
(— =) gue = —) 
where O and O are 3 x 3 orthogonal matrices, and the basis {E,} has 


been numbered so that E, is the timelike vector; these matrices 
represent rotations; and (ii) A is the matrix 


cosh 0 O sinh& 
0 iO 0 
eee) 

sinhé 0 0 e@shZ 


which represents a change of velocity in the 1-direction. With this 
decomposition, B’ 4, By, Bag Bolge > AE)?. 
For any vector Xe 7, 


> (CEs, X) On)? = 2 (<E*, x): 
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m3 ( (G2) > 2 (CE (@))) 
“(8 6) 

Therefore 

(3) -(8)) >a) @) Jere 
and so 

4 
(a), (@),)) > ale(@y GJ ero 


Let &, < oo be the least upper bound for |é| on A(é). Then 
L(A) > $L(A) eo + CRE, 


where L(A) is the length of the curve A in the metric e. Since this is 
finite, £; and L(A) must be finite. Thus the affine length of the curve 
7m(A(#)) in @, which is equal to L(A), will be finite. 

To complete the proof of proposition 8.3.1, we have to show that the 
curve 7(A(¢)) in-# has no endpoint, that is, we have to show that there 
is no point pe-# such that m(A(t)) enters and remains within every 
neighbourhood @ of p. Because of the existence of normal neighbour- 
hoods & of p, this is a consequence of the following resylt: 


Proposition 8.3.2 (Schmidt (1972)) 


Let V bea compact subset of “#. Suppose there is a curve A(t) in O(.4) 
without endpoint and of finite length, which enters and remains 


within 7~1(.4’). Then there is an inextendible null geodesic y contained 
in. 


Let A(t) be the horizontal curve through some point weA(t) such 
that m(A(t)) = m(A(t)). The curve A(t) has no endpoint. Suppose 
there were a point veO(.@) which was an endpoint of the hori- 
zontal curve A(t). Then there would be an open neighbourhood W of v 
with compact closure such that A(t) entered and remained within W. 


Let W” be the set {xeO(): Bre W for all matrices B with |£| < £,}. 
Since W was compact and &, is finite. W’ would be compact. The 
curve A(t) would enter and remain within ¥”’. But any compact set 
is m-complete with respect to the positive definite metric e. Thus 


A(t), having finite length, would have an endpoint in YW’. This shows 
that A(t) has no endpoint. 
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Let {x,} be a sequence of points on A(t) without any limit point. 
Since. is Compact, there will be a point x¢.V which is a limit point 
of 7(x,,). Let Y be a normal neighbourhood of x with compact closure, 
and let 7: VY O(.#) bea cross-section of O(. 4) over %, i.e. a(p), pE%, 
is an orthonormal basis at p. Let A(t) = o(m(A(t))) for A(t)en-"(%). 
Then as in the previous proposition, there will be a unique family of 
matrices A(t)€O(3, 1) such that A(t) = A(é) A(t), and one can express 
the matrix A in the form A = QAQ. Suppose that |£(¢,,)| had a finite 
upper bound &,, where z,,, = A(t,) is a subsequence of the x, which 
converges to x. Then the points x, would be contained in the set 
UW’ = {ve0O(#): Av < o(®) for some AeO(3,1) with |é| < &}. 
However Z would be compact and so would contain a limit point of 
the {z,,}, which is contrary to our choice of the {x,}. Thus |&(t,-)| has 
no finite upper bound. Since the orthogonal group is compact, one can 
choose a subsequence {7,-} such that Q,,. converges to some 1’, ©, 
converges to some 10’, €,-->0o, and 


Enrsi— bn > a> 0 (8.1) 


for some constant a (here Q,- = Q(t,), etc.). 

Let A’(t) = (Q')“ A(t), and let 4,,-(t) = A,-'U(Q’)-1A@). Then 4,,-(E,") 
tends to 2 = Q’a(z). Since the length of the curve A(t) is finite, the 
curve A’(t) also has finite length. This means that 


ee 2 
[0° eens Cent a+ (x) 


tends to zero, where 


X4 = (E4, (0/at),), A = u,», 2, 3, 

and ae 
tn” 41 

Thus { - |XA4| dé 


tends to zero, for each A. The components Y,,-4 of the tangent vector 
of the horizontal curve A,,.(t) are 


Y,.4 = ein X¥, ¥,0 = ein X9, VY 2= X?2, Y3 = X38, 


tn” +4 
Thus | “ly Aldé (4 =4,2,8), (8.2) 


tn” 


tend to zero. 
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Let « be the integral curve of the horizontal vector field E” through 2. 
Then 7() will be a null geodesic in 4. Suppose that m() left / in 
both the past and future directions. Then there would be some 
neighbourhood ¥ of # with compact closure and with the property 
that in each direction y left and did not re-enter the set VY’, where 
¥' ={veEO(#): there is a A with Av contained in 7}. One could 
choose ¥ sufficiently small that it had this property for any integral 
curve of E’ which intersected Y and so that any such curve would 
leave 7—1(/) in both directions. Let Y be the tube consisting of all 
points on integral curves of E* which intersect V. Then Yn a7—(.") 
would be compact. For sufficiently large n”, 4,,-(t,-) would be con- 
tained in YW. By (8.2) the components of the tangent vector to A,, 
transverse to the direction E” are so small that for large n” and 
t > t,», the curve A,,.(t) could not leave the tube Yn m4) except at 
its ends where ¥ left 7—1(.V). However 4,,-(t) cannot leave we A fas 
A(t) does not leave m(/). Thus A,,(t) would be contained in 
Yna (NV) for t>t,-. This leads to a contradiction as follows: 
Anrsiltgrs1) is contained in YW. However by (8.1), ¥ can be chosen 
sufficiently small that 


Age) = Ane Aye Anesaltnesa) 


is not contained in ¥, though it is contained in ¥~’. This shows that 
our assumption that the null geodesic m() left .{° in both directions 
is false. Thus there will be some point peé.{° which is a limit point of 
m(#). By lemma 6.2.1 there will be an inextendible null geodesic 
y through p which is contained in .f and which is a limit curve of 
71({t). O 


If O(-4) is m-incomplete, one can form the metric space completion 
O(4). This is defined to be the set of equivalence classes of Cauchy 
sequences of points in O(.@). If x = {x,} andy = {Ym} are Cauchy 
sequences in O(#), the distance p(x, y) between x and y is defined to 
be lim p(v,,.y,,) where p is the distance function on O(#) defined by 


no 
the positive definite metric e; 2 and y are said to be equivalent if 
p(x, y) = 0. One can decompose O(M) into a part homeomorphic to 
O(4) and a set of boundary points @ (ice. OF) = O(-4)uUé). The 
distance function fp defines a topology on O(-#). From (8.1), it follows 


that the topology on O(.4) is independent of the choice of basis 
{a,} of 77. 
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One can extend the action of O(3, 1) to O(.#). For under the action 
of A €O(3, 1), the transformation of the basis {G_,} is independent of 
position in O(.#). Thus there are positive constants C, and O, 
(depending only on A) such that C,p(u, v) < p(A(u), A(v)) < C,p(u, v). 
This means that under the action of A, Cauchy sequences will map to 
Cauchy sequences and equivalence classes of Cauchy sequences are 
mapped to equivalence classes of Cauchy sequences. Therefore the 
action of O(3, 1) extends to O(-#) in a unique way. One can then define 
M* to be the quotient of O(-#) by the action of O(3, 1). Since the 
quotient of O(-#) by O(3, 1) is @, and since O(3, 1) maps incomplete 
Cauchy sequences to incomplete Cauchy sequences, one can express 
M* asthe union of 4 and a set ¢ of points called the b-boundary of 4. 
One can regard points of ¢ as representing the endpoint of equivalence 
classes of b-incomplete curves in -#. 


The projection 7: O( #@) +-4*, which assigns a point in O(.#@) to its 
equivalence class under O(3, 1), induces a topology on -@* from the 
topology on O(.#). However 7 does not induce a distance function 
on .@* because p is not invariant under O(3, 1). Thus although the 


topology of O(-4) is a metric topology, and so Hausdorff, that of 4+ 
need not be Hausdorff. This means that there may be a point pe.W 
and a point g¢é such that every neighbourhood of p in .#@* intersects 
every neighbourhood of g. This happens when the point q corresponds 
to an incomplete curye which is totally or partially imprisoned in M. 
We shall discuss imprisoned incompleteness further in §8.5. 

If ¢ is a positive definite metric on 4, then M* is homeomorphic 
to the completion of (.W, &) by Cauchy sequences. Schmidt’s construc- 
tion also has the desirable property that if one cuts a closed set . out 
of a space, then one gets at least one point of the b-boundary for every 
point of 7° that is the endpoint of a curve in #—.~. An example 
where one gets more than one b-boundary point for a point of is 
provided by two-dimensional Minkowski space in which the set V is 
taken to be the t-axis between —1 and +1. Then there will be two 
b-boundary points for each point (0, t) where —1 < ¢ < 1. Anexample 
where a point in .° cannot be reached by a curve in @ —.@ is given 
by the set 

= t= sin ,¢+ obu{—1 eee or 


There is no curve in .# —.& which has an endpoint at the origin, and 
hence this point will not be in (#—.¥)*, although it isin &’. 
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Although Schmidt’s construction has an elegant formulation, it is 
unfortunately very difficult to apply in practice. The only solutions 
for which.#*+ has been found, apart from spaces of constant curvature, 
are the two-dimensional Robertson—Walker solutions with normal 
matter. In these @ turns out to be a spacelike one-surface as might be 
expected from the conformal picture. In this case, one can define a 
natural differential structure on @ and make .#* into a manifold with 
boundary. However there does not seem to be any general way of 
defining a manifold structure on @. Indeed one might expect that in 
generic situations @ would be highly irregular and could not be given 
a smooth structure. 


8.4 The character of the singularities 


In this and the following section we shall discuss the character of the 
singularities predicted by theorem 4. We consider this theorem rather 
than the others because more information about the singularity can 
be obtained. We expect however that the singularities predicted by 
the other theorems will have similar properties. 

First there is the question of how bad the breakdown of differenti- 
ability of the metric must be. The theorems of the previous section 
showed that space-time must be geodesically incomplete i## the metric 
was C®. The C® condition was necessary in order that the conjugate 
points and variation of arc-length should be well-defined: in other 
words, in order that solutions of the geodesic equation should depend 
differentiably on their initial position and direction. However one can 
talk about geodesic incompleteness provided that solutions of the 
geodesic equation are defined. They will exist if the metric is C1 and 
will be unique and depend continuously on initial position and direction 
if the metric is C?~ (i.e. if the connection is locally Lipschitz). In fact 
one can discuss b-incompleteness provided merely that the positive 
definite metric e on the bundle of frames O(.#) is defined almost every 
where and is locally bounded. This will be the case if the components 
I7,, of the connection are defined almost everywhere and are locally 
bounded, i.e. if the metric is C!-. 

It thus might appear that what the theorems indicate is not that 
the curvature becomes unboundedly large but merely that it has a 
discontinuity (i.e. the metric is C?- rather than C2). We shall show that 
this is not the case: under the conditions of theorem 4 space-time must 
be timelike geodesically incomplete (and hence b-incomplete) even if 
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the metric is only required to be C?-. The method of proof is to 
approximate the C?~ metric by a C? metric and to perform variation 
of arc-length in this metric. 

Suppose that space-time is defined to be inextendible with a C?- 
metric and that the conditions of theorem 4 are satisfied. The timelike 
convergence condition, Rh, A*K® > 0, is now required to hold ‘almost 
everywhere’ with the Ricci tensor defined by generalized derivatives. 
The only part of the proof of theorem 4 that does not hold in a C?- 
metric is where variation of arc-length is used to show that there can 
be no point peD+(F) such that d(F,p) > — 3/0), where 0, is the 
maximum value of y%, on Y. Thus if .@ were timelike geodesically 
complete there would be some such point p and a geodesic orthogonal 
to F of length d(F, p) from F to p. Let Y be an open set with compact 
closure which contains J-(p)n J+(S) and let e and & be C® positive 
definite and Lorentz metrics respectively. For any ¢ > 0 one could 
find a C” Lorentz metric g,@ such that on Y 

(1) |g. —9%| <6 

(2) (oI inl ae 

(3) |g..24| < C, where C is a constant depending on Y, e, 8 and g, 

(4) R,.,K*K® > —e|K2|? for any vector K such that g,,,K*K® > 0. 
(The g,2” may be constructed by covering Y by a finite number of 
local coordinate neighbourhoods (¥,,, ¢,), integrating the coordinate 
components of g%® with a suitable smoothing function p,(x) and 
summing with a partition of unity {y,}, ie. 


9,(q) = Sala) | g° (2) p_(ae—$,(q)) ate, 
a gu(W a) : 


where { pigs —1)) 

Property (1) implies that for sufficiently small values of ¢, p would 
be in D+(¥,g,) and J~(p, 8.) N J+(F,&,) would be contained in &. 
There would therefore be a geodesic y, in the metric g, from SF to p of 
length d,(f, p). Also \d.(P, p) -d(f,p)| would tend to zero as e> 0. 

By properties (1), (2) and (3), and the standard theorems on ordinary 
differential equations, as «+0 the tangent vector to a geodesic 
in the metric §, would tend to that of the geodesic in the metric 
¢ with the same initial position and direction. There would be 
some upper bound to | V4| on Un BF x [0, 2d(F,p))), where V“is the 
unit tangent vector to the geodesic orthogonal to in the metric g. 
Thus for any 6 > 0 there would be an ¢, > 0 such that for any € < ¢, 
Reap V2 V2 > — 6. We can now establish a contradiction by showing that 
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a sufficiently small variation of the energy condition will not prevent 
the occurrence of conjugate points in the metric g, within a distance 
less than d,(F, p). For the expansion 9, of the geodesics in the metric 
g. obeys the Raychaudhuri equation: 


d6,/ds ie 302 Ww 20°, = Rags V2 ae 


Thus d(0,-1)/ds > 4+ R.., V*V°O,*. Therefore if the initial value @,, 
were negative and 360,,-? were less than one, 6,~' would become zero 
within a distance 3/6(1—3060,-*) from Y. But 6,,> 6) as €> 0. This 
shows that for sufficiently small values of € there would be a conjugate 
point on every geodesic in the metric g, orthogonal to F within a 
distance less than d,(Y, p). Therefore .W must be timelike geodesically 
incomplete even if the metric is required only to be C?-. 

This result implies that if space-time is extended to try to continue 
the incomplete geodesics, the metric must fail to be Lorentzian or the 
curvature must be locally unbounded, i.e. there would be a curvature 
singularity. However even though the curvature were locally un- 
bounded, the metric might still be able to be interpreted as a distribu- 
tional solution of the Einstein equations provided that the volume 
integrals of the components of the curvature tensor over any compact 
region were finite. This would be the case if the metric were Lorentz, 
continuous and had square integrable first derivatives. In particular 
this would be true if the metric were Lorentz and C!~ (i.e. locally 
Lipschitz). Examples of such C1~ solutions include gravitational shock 
waves (where the curvature has a 6-function behaviour on a null three- 
surface, see, for example, Choquet-Bruhat (1968) and Penrose 
(1972a)); thin mass shells (where the curvature has a 6-function 
behaviour on a timelike three-surface, see, for example. Israel (1966)); 
and solutions containing pressure-free matter where the geodesic flow 
lines have two- or three-dimensional caustics (see Papapetrou and 
Hamoui (1967), Grischuk (1967)). Because of the non-linear depend- 
ence of the curvature on the metric one cannot necessarily approxi- 
mate a C?~ distributional solution by a C? metric which obeys the 
convergence condition at every point, or at least does not violate it 
by more than a small amount as in the case above (property (4)). 
However in all the examples given above one can. Indeed this is their 
physical justification: they are regarded as mathematical idealizations 
of C? or C® solutions which obey the convergence condition and in 
which the curvature is very large in a small region. One could apply 
the theorems of §8.2 to these C? solutions and prove the existence of 
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incomplete geodesics in them. This shows that the singularities pre- 
dicted cannot be just gravitational impulse waves or caustics of flow 
lines but must be more serious breakdowns of the metric. (Ordinary 
hydrodynamic shock waves involve only discontinuities of density 
and pressure and so can exist with a C?- metric.) Although we are not 
quite able to prove it we believe that the singularities must be such 
that the metric cannot be extended to be even a distributional solution 
of the Einstein equations, i.e. as well as the components of the curva- 
ture being unbounded at a singular point, their volume integral over 
any neighbourhood of sucha point must also be unbounded. This is so 
in all known examples of singularities other than the exceptional case 
of the Taub-NUT solution, which will be dealt with in the next section. 
If this conjecture is correct for ‘generic’ singularities (ie. except for 
those arising from a set of initial conditions of measure zero), then one 
can regard a singularity as a point where the Einstein equations (and 
presumably the other presently known laws of physics) break down. 

Another question one would like to answer is: how many incomplete 
geodesics are there? If there were only one, one might be tempted to 
feel that the singularity could be ignored. From the proof of theorem 4 
one can see that if there is no Cauchy horizon, ice. if Y is a Cauchy 
surface, then no timelike curve from # (geodesic or not) can be 
extended to a length greater than — 3/6, where 0, is the maximum 
value of v*, on /. In fact this result is true even if # is non-compact 
provided that y“, still has a negative upper bound. However this does 
not necessarily indicate that what happens is that every timelike curve 
hits the singularity. Rather it suggests that a singularity will be 
accompanied by a Cauchy horizon and so our ability to predict the 
future will break down. An example of this is shown in figure 53. Here 
the metric is singular at the point p and so this point has been removed 
from the space-time manifold. Spreading out from this hole there is 
a Cauchy horizon. This example shows that the most one can hope to 
prove is that there is a three-dimensional family of geodesics which 
are incomplete and which remain within the Cauchy development of 
F (in the example these are the geodesics which would pass through p). 
There may be other geodesics which leave the Cauchy development 
of FY and which are incomplete but one cannot predict their behaviour 
from knowledge of conditions on S. 

It is clear that there must be more than one incomplete geodesic in 
D+(f). For from theorem 4 it follows that there must be a geodesic y, 
orthogonal to SF, which remains in D+(S/) but which is incomplete. 
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PS) 
P 
? 


FIGuRE 53. The point p has been removed from space-time because a singularity 
occurs there. Consequently there is a Cauchy horizon H HS ) for the surface f. 


Let p be the point where y intersects “. Then one can make a small 
variation of S in a neighbourhood of p to obtain a new surface ¥’ for 
which y%, is still negative, but which is not orthogonal to x. Then by 
theorem 4 there must be some other timelike geodesic y’ orthogonal 
to F’ which is incomplete and which does not cross H+(.'), which is 
the same as H HS ip 

One can in fact prove that there is at least a three-dimensional 
family of timelike geodesics (one through each point of some achronal 
surface) which remain within D+() and which are incomplete. These 
geodesics all correspond to the same boundary point in the sense of the 
indecomposable past sets of §6.8, that is, they all have the same past. 
They may not, however, all correspond to the same points as defined 
by the construction of the previous section. An outline of the proof is 
as follows: in theorem 4 it was shown that there must be a future- 
directed timelike geodesic orthogonal to Y which cannot be extended 
to length 3/0). One can say more than this: there must be such a 
geodesic y which remains within D+(7) and is at each point a curve 
of maximum length from FY, i.e. for each qey, the length of y from F 
to gy equals d(., q). The idea is now to consider the function d(r, y) for 
re J~(y). Clearly this is bounded on J S )n J~(y). From the fact that 
y is a curve of maximum length from Y, it follows that ina neighbour- 
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hood of y, d(r, y) is continuous and the surfaces of constant d(r, y) are 
spacelike surfaces which intersect y orthogonally. The timelike geo- 
desics orthogonal to these surfaces will then remain within J~(y) and 
so will be incomplete. 


8.5 Imprisoned incompleteness 


In §8.1 we proposed b-incompleteness as a definition of a singularity. 
The idea was that a b-incomplete curve corresponded to a singular 
point which had been left out of space-time. However suppose that 
there is a b-incomplete curve A which has a limit point pé.4@, i.e. Ais 
partially or totally imprisoned in a compact neighbourhood of p. Then 
one cannot imbed .@ in a larger four-dimensional Hausdorff para- 
compact manifold .# such that A can be continued in-@’. For if g were 
the point where A intersected the boundary of -# in .-4@’, then any 
neighbourhood of g would intersect any neighbourhood of p, which 
would be impossible as #’ is Hausdorff and g + p. In fact, one can 
characterize imprisoned incompleteness of -4 by non-Hausdorff 
behaviour of the Schmidt completion .4*. 


Proposition 8.5.1 

A point pé.M is not Hausdorff separated in .@* from a point red if 
there is an incomplete curve A in @ which has 7 as a limit point and 
which has r as an endpoint in +. 


Suppose that p¢-@ is a limit point of a b-incomplete curve A. One can 
construct a horizontal lift A of A in the bundle of orthonormal frames 
O(.4). This will have an endpoint at some point 


xen-Ur) < d = O(M)—-O(4M). 


If Y is a neighbourhood of r in -@*+ then 7-\(¥ ) is an open neighbour- 
hood of in O(-@). Thus it contains all points on A beyond some pointy. 
Therefore all points on A beyond 7(y) will lie in V and hence ¥'will 
intersect any neighbourhood of p since p is a limit point of A. O 


Taub-NUT space (§5.8) is an example where there are incomplete 
geodesics which are all totally imprisoned in compact neighbourhoods 
of the past and future horizons U(t) = 0. As the metric is perfectly 
regular on these compact neighbourhoods, the incomplete geodesis. 
do not correspond to s.p. (scalar polynomial) curvature singularities. 
Consider a future incomplete closed null geodesic A(v) in the future 
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horizon U(t) = 0. Let p = A(0) and let », be the first positive value of v 
for which A(v) = p. Then as in §6.4, the parallelly propagated tangent 
vector to A will satisfy 

(2/20) |y—», = a(6/8v)| 0; 
where a > 1. For each n, the point A(v,,) = p, where 


Vp, = Vy 2 oe nes jL—@es ? 
and (2/20)|,-», = 2"(4/20)|,=0- 


Thus if one takes a pseudo-orthonormal parallelly propagated basis 
{E,} on A(v), where E, = é/év, then the other null basis vector E, obeys 
E5|,=1, = @ "Es3|,.9. Each time one goes round the closed null geo- 
desic A, the vector E, gets bigger and the vector E, gets smaller. The 
vectors E, and E, remain the same. If therefore there were some non- 
zero component of the Riemann tensor which involved E, and 
possibly E, and E,, it would appear bigger and bigger each time one 
went round A and so there would be a p.p. (parallelly propagated) 
curvature singularity. However in Taub—NUT space it turns out that 
the vector E, can be chosen so that there is only one independent non- 
zero component of the Riemann tensor, which is R(E,, E,, Es, E,). 
This involves E, and E, equally, and so has the same value each time 
round. Since a similar argument will probably hold for any imprisoned 
curve, it seems there is no p.p. curvature singularity in Taub-NUT 
space, although this space is singular by our definition. One would like 
to know whether this kind of behaviour would occur in physically 
realistic solutions containing matter, or whether Taub-NUT space is 
an isolated pathological example. This question is important because, 
as we shall argue in the next chapter, we interpret the preceding 
theorems as indicating not that geodesic incompleteness necessarily 
occurs, but that General Relativity breaks down in very strong gravita- 
tional fields. Such fields donot occurin the Taub—N UT kind of situation. 
This conclusion is a result of the very special nature of the Riemann 
tensor in Taub-NUT space. In general. one would expect some other 
components of the Riemann tensor to be non-zero on the imprisoned 
curve, and so there would be a p.p. curvature singularity even though 
there might be no s.p. curvature singularity. In fact one can prove: 


Proposition 8.5.2 


If pe is a limit point of a b-incomplete curve A and if at p, 
Fi, A*k® + 0 for all non-spacelike vectors K, then A corresponds to 
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a p-p. curvature singularity. (This condition can be replaced by the 
condition that there do not exist any null directions A@ such that 
K°K CpaK q = 0.) 


Let Y be a convex normal coordinate neighbourhood of p with com- 
pact closure, and let {Y,}, {Y*} be a field of dual orthonormal bases 
on %. Let {E,}, {E%} be a parallelly propagated dual orthonormal basis 
on the curve A(t). Let # be a parameter on A such that in %, 


di/dé = (X X*X*)t, 


where X* are the components of the tangent vector @/ét in the basis 
{Y,}. Then # measures arc-length in the positive definite metric on Y 
in which the bases {Y,}, {Y*} are orthonormal. 

Since R,, A“K® + 0 at p for any non-spacelike vector K¢, there is 
a neighbourhood ¥ < Y such that R,, = CZ,Z,+R'.,, where C + 0 
is a constant, Z, is a unit timelike vector, and R’,, is such that 
CR’, K*K° > 0 for any non-spacelike vector K*. Suppose that after 
some value f, of the curve A intersects Y. Since A has no endpoint 
and since p is a limit point of A, the part of A in WY will have infinite 
length as measured by f. However, the generalized affine parameter is 


given by du/d? = {> (H2, X44, 


where X? are the components of the tangent vector (0/ét),, so 
ps 2 ‘X! = 1, and E+, aré the components of the basis {E%} in the basis 


4 
{Y‘}. Since wu is finite on the curve, the modulus of the column vector 


E+, X* must go to zero, and so the Lorentz transformation represented 
by the components E%, must become unboundedly large. Since Z is 
a unit timelike vector, the components of Z in the basis {E,} will 
therefore become unboundedly large and hence some component of 
the Ricci tensor in the basis {E,} will become unboundedly large. O 


This result shows that an observer whose history was a b-incomplete 
imprisoned non-spacelike curve in a generic space-time would be torn 
apart by unboundedly large curvature forces in a finite time. However 
another observer could travel through the same region without experi- 
encing any such effects. An interesting example in this connection is 
provided by Taub-NUT space in which the metric has been altered 
by a conformal factor Q which differs from one only in a small neigh- 
bourhood of a point p on the horizon. This conformal transformation 
would not alter the causal structure of the space and would not affect 
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the incompleteness of the closed null curve through the point p. How- 
ever in general R,, K*K° + 0 where K@ is the tangent vector to the 
closed null geodesic. After each cycle, 2, K*K® increases by a factor a? 
and so there is a p.p. curvature singularity. Yet the metric is perfectly 
regular on a compact neighbourhood of the horizon and so there is 
no s.p. curvature singularity associated with the incompleteness. 

One would like to rule out this kind of situation in which the 
incomplete curves are totally imprisoned in a compact region. This . 
kind of behaviour might occur in a countably infinite number of 
different regions of space-time. Thus one cannot describe it by saying 
that all the incomplete curves are totally imprisoned in one compact 
set. Instead one wants to describe it by saying that a set of incomplete 
curves which are compact in some sense are totally imprisoned in a 
compact region of .4. To make this concept precise, we define 
b-boundedness as follows. 

We define the space B(-#) to be the set of all pairs (A, u), where u is 
a point in the bundle of linear frames L(-#) and A is a C! curve in @ 
which has only one endpoint, which is at m(u). Let Y be an open set 
in @ and ¥ be an open set in L(.#). We define the open set O(Y, ¥’) 
to be the set of allelements of B(.#) such that A intersects Yanduev. 
The sets of the form O(Y,V ) for all Y, ¥ form a sub-basis for the 
topology of B(.@). Recall that the map exp: T(.#)> Mi is defined by 
taking a vector X at a point p and proceeding along the geodesic 
from p in the direction of X a unit distance as measured in the 
affine parameter defined by X. Similarly we may define a map 
Exp: B(.4)—>- by proceeding from m(u) along the curve A a unit 
distance as measured in the generalized affine parameter on A defined 
by u. The map Exp is continuous and will be defined for all of B(A) 
if @ is b-complete. We shall say that (.#, 8) is b-bounded if for every 
compact set Wc B(.@), Exp(W) has a compact closure in 4. Since 
Exp is continuous, (.W, 8) is b-bounded if it is b-complete. However, 
Taub-NUT space is an example which is b-bounded but not b-com- 
plete. We shall show that this can be possible only because Taub-NUT 
space is completely empty. The presence of any matter on the surface 
F in theorem 4 will mean that the space is both b- -Incomplete and 
b-unbounded. 


Theorem 5 


Space-time isnot b-bounded if conditions (1 )-(3) of theorem 4 hold, and 
(4) the energy-momentum tensor is non-zero somewhere on £ 


3 
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(5) the energy-momentum tensor obeys a slightly stronger form of 
the dominant energy condition (§4.3): if K@ is a non-spacelike vector, 
then 7'°’K,, is zero or non-spacelike and T,,, K*K° > 0, equality holding 
Only if Tesh e="0: 


Remark. Condition (4) could be replaced by the generic condition 
(see Theorem 2). 


Proof. Consider the covering space , (§6.5) defined as the set of all 
pairs (p, 27[A]). where A is a curve from q to p, p, gE M, and iA] is the 
number of times A cuts ¥ in the future direction minus the number of 
times it cuts it in the past direction. For each integer a, 


S, = {(p, iA): pe, iA] = a} 
is diffeomorphic to Y and is a partial Cauchy surface in @,. In 


general .%,, need not be b-bounded if -@ is, but in the situation under 
consideration we have the following result: 


Lemma 8.5.3 

Let conditions (1)-(3) hold and let D+(.4) not have compact closure 
in .&,; then if % is the covering projection y: 4,>-M, (Dt(S)) 
will not have compact closure in -4. 


AM is either diffeomorphic to -4, or to -4,, the portion of 4, between 
Fand &., with K and’, identified. Tf for any a > 0,-4, Dt(S) 
does not have compact closure in 4%, then w(D*(.4)) will not have 
compact closure in 4. If however -4@, | D+(.%) had compact closure 
for all a > 0 it would also have to be non-empty for all a > 0 since 
D*(&) is non-compact. But for peX%, the proper volume of 
I~(p)n 4,_, has some lower bound c. Thus for every a > 0 the proper 
volume of M,  D+(.%) could not be less than c. But this is impossible 
since by conditions (1)-—(3 , _ proposition 6.7.1, the proper volume of 
D+(SH) is less than 3/(—@ ) x (area of Y), where @ is the negative 
upper bound of y“%, on in Oo 


Using this result, one can prove: 


Lemma 8.5.4 
If D+(%) does not have compact closure, -W is not b-bounded. 


Let W be the subset of B(-4,) consisting of all pairs (A, v) where A is 
any future-inextendible timelike geodesic curve in 4, orthogonal to 
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&, with endpoint re. %, and wem-\(r) is any basis at r, one of whose 
vectors is tangent to A and of length — 3/6), the remaining vectors 
being an orthonormal basis in %. 

Let {,} be a collection of open sets which cover W. Each &, will 
be the union of finite intersections of sets of the form O(W,V ). It is 
sufficient to consider the case when the Y can be represented as 


es = Nn O(Ugs% x)» 


where for each « the W,, are a —_ number of open sets in -@,, and 
¥, is an open set in L(-@,). Let (u,v) —eW. Then there is some « such 
that (uz, v)€FA,. This means that the geodesic u intersects the open set 
U,, for each value of f and that ve¥,. Since geodesics depend con- 
tinuously on their initial conditions there will be some neighbourhood 
Y, of m(v) such that every future-inextendible geodesic through Y, 
orthogonal to G will intersect Y,, for each value of #. Let VW", be an 
open set contained in V, such that 7(¥',) < Y,. Then 


(#, v)€O(m(¥",), Ya) 


is contained in Z,. Thus the sets {O(7(¥',), V’,)} form a refinement of 
the covering F,,. 

Consider the subset 2 of L(.W,) consisting of all bases over % 
where one of the basis vectors is orthogonal to SG and of length 
— 3/0), and the remaining vectors are an orthonormal basis of i. 
Since 2 is compact, it can be covered by a finite number of the sets 
¥",, Thus W is compact since it can be covered by a finite number of 
the sets O(7(V"',.), V',). 

By proposition 6.7.1 each point of D+(%) lies within a proper 
distance — 3/9, along the future-directed geodesic orthogonal to &. 
This means that Exp(W) contains D+(.%). Let y,: B( Mo) > B(M) 
be the map which takes (A, u)eB(M_) to (y(A), yu) € B(MW). Then 
YW will be a compact subset of B(.W) such that 


Exp (%W) > (D*(%)). 


Thus if D*(%) is not compact, y(D*(%)) is not compact. so (4, §) 
is not b-bounded. Oo 


This shows that it is sufficient to prove DHS ) non-compact. Suppose 
it were compact. Then H+(%) would also be compact. We show below 
that this would imply that the divergence of the null geodesic 
generators would have to be zero everywhere on H +(4). This would 
be impossible if the matter density were non zero somewhere on H ODE. 


8.5] IMPRISONED INCOMPLETENESS 295 


Lemma 8.5.5 


If H*(2) is compact for a partial Cauchy surface 2, then the null 
geodesic generating segments of H+(2) are geodesically complete in 
the past direction. 


From proposition 6.5.2 it follows that the generating segments have 
no past endpoints. They must therefore form ‘almost closed’ curves 
in the compact set H+(2). If they formed actual closed curves, one 
could use proposition 6.4.4 to show that if they were incomplete in the 
past direction, they could be varied towards the past to give closed 
timelike curves. This however would be impossible since such curves 
would lie in D*(2). The proof in the case when the null geodesic 
generators of H~(2) are only ‘almost closed’ is similar though a little 
more delicate. 

Introduce a future-directed timelike unit vector field V which is 
geodesic in a neighbourhood Y of H*(2) with compact closure. Define 
the positive definite metric 8’ as in proposition 6.4.4 by 


g'(X, iy = g(X, Y) ar 29(X, V) g(Y, V) 


and let t be a parameter which measures proper distance in the metric 
g’ along a null geodesic generating segment y of H*(2), and which is 
zero at some point gey. Then g(V, é/et) = — 2-2. As y has no past 
endpoint, ¢ will have no lower bound. Let f and h be given by 

“a D(a @_,8 


where v is an affine parameter. Suppose y were geodesically incomplete 
in the past, then the affine parameter 


t 
v= | h- dt’ 
0 


would have a lower bound v, as t->—0o. Now consider a variation 
a of y whose variation vector é/éu is equal to —xV. Then 


o (é a dx =) 

— — — 9-4 — pri 5 8.3 

a Va 5) ke (F er S3) 
Since h—>co as t->—00, one could find a bounded function x(t) such 
that (8.3) was negative for all ¢ < 0. However this would not be suffi- 


cient to ensure that the variation gave an everywhere timelike curve 
since it could be that the range of u for which (8.3) remained negative 
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tended to zero as t->—oo. To deal with this we shall consider the 
second derivative under the variation: 


a (a a\ @ a) 
au! \oe de} Bu \9 \ ae dt Ou 


a anal +o(s-2(Sa)z) 

s daw eo TINGE ot du du) 7a Vou’ H) au) 
Choosing éz/éu to be zero and using the fact that V is a geodesic in 
a neighbourhood Y of H*(2) this reduces to 


dx\? DV DV o rs) 
Peery (mses “2 aa ext 
a ae lo(F a? at See eh 5) ¥) | 


for 0 < u < «. In any basis orthonormal with respect to the metric 9’, 
the components of the Riemann tensor and of the covariant derivative 
of V (with respect to §) will be bounded on Y. Thus there is some 


C > 0 such that a @ 3 ae 0 a 
due? \oe oel = ~~ 2 \oe aE) 


7) a dx 
Now ma (v. *)) = di’ 
re) a(v. 5) =~ 24S, 7 
Therefore 
(a @ da 
Rae Ee ve 2 
(55) a(5x)+! (2x2) 05 ft 2 (G) 
<9(5. ara 
for 0 <u <e, where d = (2,/2)eC,+2e°0,/2+1, and C, is an upper 
bound o ldxfat|. Thus we have 
‘the 
oe 4 < C%x%(y +d) 
oy ae d 
and Fils oath oF (Ax), Yy|yno = 0, 


where y = g(@/0t, 0/0t). Therefore 
y < d(cosh Cru—1)+asinh Cru 
< sinh Cru(d tanh 4Cxru+a), 
where a = 2-#C0—'d (log hx) /dt. 
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Now take 9 -1 
x= te] —| bau + K| ; 
t 


0 
where K=2 i die 


then a = — 2-*C—ha. Since f = — h-\(dh/dt) is bounded on the compact 
set H*+(2) and since F 
i jai = = 
t 


was assumed to converge as t-> — 00, there would be upper bounds for 
x and |da/dt| and a positive lower bound C, for h when —c <¢ < 0. 
Then for 0 < u < min(e,2C-*d-1C,), y would be negative when 
ao <i < 0. 

In other words, the variation « would give a past-inextendible time- 
like curve which lay in int D*(2) and which was totally imprisoned in 
the compact set Y. But this is impossible, since by lemma 6.6.5 the 
strong causality condition holds on int D+(2). Thus y must be geo- 
desically complete in the past direction. Oo 


Consider the expansion @ of the tangent vectors @/ét to the null 
geodesic generators of H~(.A). Suppose that 9 > 0 at some point ¢g on 
a generator y and let 7 be a spacelike two-surface through g in a 
neighbourhood of g in H*(4%). The generators of H+(.4) will be 
orthogonal to F and w ould be converging into the past. Then by 
condition (1) and the above lemma there would be a point rey conju- 
gate to J along y (proposition 4.4.6). Points on y beyond r could be 
joined to 7 by timelike curves (proposition 4.5.14). But this would 
be impossible since H+(.%) is an achronal set. Therefore O< 0 on 
H*(S). 

Now consider the family of differentiable maps £,: H*(.%,) > H+(“) 
defined by taking a point g¢H+(S,) a distance z (measured in the 
metric $’) to the past along the null geodesic generator through gq. 
Let dA be the area measured inthe metric §’ of a small element of 


H+(S%,). Under the map £,, 


d 
q4 = —§dA. 
Thus = dA = -{ 6dA. (8.4) 
dz J pci* (2) BAH + (Po)) 


But £, maps H+(.4) into H*(%) (and onto if the generating segments 
have no future endpoints). Thus (8.4) must be less than or equal to 
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zero. Together with the previous result this would imply 0 = 0 on 
H+(&). By the propagation equation (4.35) this is possible only if 
R»K*K® = 0 everywhere on H+(.%), where K is the tangent vector to 
the null geodesic generator. However by the conservation theorem 
of §4.3 condition (5) implies that 7,,K7K® is non-zero somewhere 
on H+(f) and by the Einstein equations (with or without A), 7, K*K® 
equals R,,K*K®. (Strictly, the form of the conservation theorem 
required is slightly different from that in §4.3. Since there are no 
suitable spacelike surfaces which intersect H+(%), one uses instead 
a family of surfaces one of which is H*(.%), the others being spacelike. 
These surfaces can be defined by taking the value of the function ¢ at 
the point pe D+(K) to be minus the proper volume of J+(p)n D*+(G). 
Since ¢,, becomes null on H*(.4%), it is no longer necessarily true that 
there is a constant C > 0 such that on D+(%), 


TE 5 OTR. 
However if Vis a timelike vector field on Dagy there is a constantC 
such that Tt. 9 < OT(t. at. + teal) 
and. Tey, 4 < CLA. +t... V5); 


One can then proceed as in §4.3 using T(t. ,,+ V,. 2) in place of 
Tt, 4», and proving that T@(t,, ty +t,, 4) cannot be zéro on H+(%) 
if it is non-zero on .%. The result then follows from (5).) O 


a) 


Gravitational collapse and black holes 


In this chapter, we shall show that stars of more than about 14 times 
the solar mass should collapse when they have exhausted their 
nuclear fuel. If the initial conditions are not too asymmetric, the 
conditions of theorem 2 should be satisfied and so there should be 
a singularity. This singularity is however probably hidden from the 
view of an external observer who sees only a ‘black hole’ where the 
star once was. We derive a number of properties of such black holes, 
and show that they probably settle down finally to a Kerr solution. 

In § 9.1 we discuss stellar collapse, showing how one would expect 
a closed trapped surface to form around any sufficiently large spherical 
star at a late stage in its evolution. In §9.2 we discuss the event 
horizon which seems likely to form around such a collapsing body. 
In §9.3 we consider the final stationary state to which the solution 
outside the horizon settles down. This seems to be likely to be one of 
the Kerr family of solutions. Assuming that this is the case, one can 
place certain limits on the amount of energy which can be extracted 
from such solutions, 

For further reading on black holes, see the 1972 Les Houches 
summer school proceedings, edited by B. S. de Witt, to be published 
by Gordon and Breach. 


9.1 Stellar collapse 


Outside a static spherically symmetric body such as a star, the solution 
of Einstein’s equations is necessarily that part of one of the asymp- 
totically flat regions of the Schwarzschild solution for which r is 
greater than some value r corresponding to the surface of the star. 
This will be joined, for r < rg, onto a solution which depends in detail 
on the radial distribution of density and pressure in the star. In fact 
even if the star is not static, providing it remains spherically symmetric 
the solution outside will still be part of the Schwarzschild solution cut 
off by the surface of the star. (This is Birkhoff’s theorem, proof of 
which is given in appendix B.) If the star is static then 7) must be 
[ 299 ] 
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r< 2m r= 2m r > 2m 


Schwarzschild empty 
solution (exterior solution) 
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Surface of fluid 
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polar 
coordinates 
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r = 0 (singularity) 
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(coordinate 
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I 
| 
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(ii) 
Ficure 54. Collapse of a spherical star. 
(i) Finkelstein diagram ((r,t) plane) of a collapsing spherically symmetric 
fluid baJl. Each point represents a two-sphere. 
(ii) Penrose diagram of the collapsing fluid ball. 
(ii) Diagram of the collapse with only one spatial dimension suppressed. 


greater than 2m (the “Schwarzschild radius’). This follows because 
the surface of a static star must correspond to the orbit of a timelike 
Killing vector, and in the Schwarzschild solution there is a timelike 
Killing vector only where r > 2m. If r) were less than 2m, the surface 
of the star would be expanding or contracting. To get an idea of the 
magnitude of the Schwarzschild radius, we note that the Schwarz- 
schild radius of the earth is 1.0 em and that of the sun is 3.0 Km; 
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the ratios of the Schwarzschild radius to the radius of the earth and 
the sun are 7x 10-!° and 2x 10-* respectively. Thus normal stars 
are a long way from their Schwarzschild radii. 

The life of a typical star will consist of a long (~ 10° years) quasi- 
static phase in which it is burning nuclear fuel and supporting itself 
against gravity by thermal and radiation pressure. However when the 
nuclear fuel is exhausted, the star will cool, the pressure will be 
reduced, and so it will contract. Now suppose that this contraction 
cannot be halted by the pressure before the radius becomes less than 
the Schwarzschild radius (we shall see below that this seems likely for 
stars of greater than a certain mass). Then since the solution outside 
the star is the Schwarzschild solution, there will be a closed trapped 
surface 7 around the star (see figure 54), and so, by theorem 2, a 
singularity will occur provided that causality is not violated and the 
appropriate energy condition holds. Of course in this case, because the 
exterior solution is the Schwarzschild solution, it is obvious (see 
figure 54) that there must be a singularity. However the point is that 
even if the star is not exactly spherically symmetric, a closed trapped 
surface will still occur providing the departures from spherical sym- 
metry are not too great. This follows from the stability of the Cauchy 
development proved in §7.5; for one can regard the solution as 
developing from a partial Cauchy surface # (figure 55). Now if one 
changes the initial data by a sufficiently small amount on the compact 
region J (7) #, the new development of # will still be sufficiently 
near the old in the compact region J+(#) 1 J~(7) that there will still 
be a closed trapped surface around the star in the perturbed solution. 
Thus we have shown that there is a non-zero measure set of initial 
conditions which lead to a closed trapped surface and hence to a singu- 
larity by theorem 2. 

The two principal reasons why a star may depart from spherical 
symmetry are that it may be rotating or may have a magnetic field. 
One may get some idea of how large the rotation may be without 
preventing the occurrence of a trapped surface by considering the Kerr 
solution. This solution can be thought of as representing the exterior 
solution for a body with mass m and angular momentum L = am. 
If a is less than m there are closed trapped surfaces, but if a is gréater 
than m they do not occur. Thus one might expect that if the angular 
momentum of the star were greater than the square of its mass, it 
would be able to halt the contraction of the star before a closed trapped 
surface developed. Another way of seeing this is that if L = m? and 
angular momentum is conserved during the collapse, then the velocity 
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Figure 55. Collapse of a spherical star as in figure 54 (ui) showing a partial 
Cauchy surface #. It is the initial data on the compact region J-(7)n# 
of # which leads to the occurrence of the closed trapped surface Y in the 
compact region J-(7) NJ+(#). 


of the surface of the star would be about the velocity of light when the 
star was at its Schwarzschild radius. Now many stars have an angular 
momentum greater than the square of their mass (for the sun, ZL ~ m2). 
However it seems reasonable to expect some loss of angular momentum 
during the collapse because of braking by magnetic fields and because 
of gravitational radiation. The situation is therefore that in some stars, 
and probably most, angular momentum would not prevent occurrence 
of closed trapped surfaces, and hence a singularity. 

In a nearly spherical collapse a magnetic field B which is frozen into 
a star will increase as the matter density p to the 2 power. Thus the 
magnetic pressure is proportional to p*. This rate of increase is so slow 
that if the magnetic pressure is not important initially in supporting 
the star, then it will never be strong enough to have a significant effect 
on the collapse. 

To see why a burnt-out star of more than a certain mass cannot 
support itself against gravity, we shall give a qualitative discussion 
(based on unpublished work ‘by Carter) of the zero temperature 
equation of state for matter. 
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In hot matter there is pressure produced by the thermal motions 
of the atoms and by the radiation present. However in cold matter at 
densities lower than that of nuclear matter (~ 10!4gm em-%), the only 
significant pressure will arise from the quantum mechanical exclusion 
principle. To estimate this, consider a number density n of fermions 
of mass m. By the exclusion principle, each fermion will effectively 
occupy a volume of n—!. Thus by the uncertainty principle, it will have 
a spatial component of momentum of order fin}. If the fermions are 
non-relativistic, i.e. if in? is less than m, the velocity of the fermions 
will be of order int/m, while if the fermions are relativistic (i.e. iin? is 
greater than m) then the velocity will be practically one (the speed of 
light). The pressure will be of order (momentum) x (velocity) x (num- 
ber density), and so will be ~ #2n*m-1if int < m, and will be ~ find if 
int > m. When the matter is non-relativistic, the principal contribu- 
tion to the degeneracy pressure comes from the electrons, since m7} 
for them is bigger than it is for baryons. However at high densities, 
when the particles become relativistic, the pressure is independent of 
the mass of the particles producing it and depends simply on their 
number density. 

For small cold bodies, self-gravity can be neglected and the 
degeneracy pressure will be balanced by attractive electrostatic forces 
between nearest neighbour particles arranged in some sort of lattice. 
(We assume that there are equal numbers of positive and negative 
charges and approximately equal numbers of electrons and baryons.) 
These forces will produce a negative pressure of order e2n*. Thus the 
mass density of a small cold body will be of order 


Cmem,n* (~ lemem~), (9.1) 


where m, is the electron rest-mass and m, is the nucleon rest-mass. 

For larger bodies self-gravity will be important, and will compress 
the matter against the degeneracy pressure. To obtain an exact solu- 
tion would involve a detailed integration of Einstein’s equations. 
However the important qualitative features can be seen more easily 
from a simple Newtonian order of magnitude argument. In a star of 
mass M and radius 7, the gravitational force on a typical unit volume 
is of the order (M/r)2)nm,, where nm, ~ M/r,? is the mass density. 
The gravitational force will be balanced by a pressure gradient of 
order P/r), where P is the average pressure in the star. Thus 


P = M?/r,t ~ Ménim,}. 
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If the density is sufficiently low that the main contribution to the 
pressure is from the degeneracy of non-relativistic electrons, 


P =??nim, = Minin, J, 
so n= M*m,tmin-. 


This will be the correct formula for bodies for which it yields a value of 
n greater than (9.1) and less than m,°4-, i.e. for e8m,? < M < him, 
Such stars are known as white dwarfs. 

If the density is so high that the electrons are relativistic, es 
n > m,*h-%, then the pressure will be given by. the relativistic formula; 
so 2 — hd = = Mintm,3. Now n cancels out of this equation. Thus 
apparently one obtains a star of mass 


M,, = him, ~ 1.5 Mo, 


which can have any density greater than m,°m,f-%, i.e. any radius 
less than %#m,~!m,—!. Stars of mass greater than M4, simply cannot 
be supported by the degeneracy pressure of electrons. 

In fact, when the electrons become relativistic they tend to induce 
inverse beta decay with the protons, producing neutrons: 


Gs e100, sr lle 


This denudes the electrons and hence reduces their degeneracy 
pressure, thereby causing the star to contract and making the 
electrons more relativistic. This is an unstable situation, and the 
process will continue until nearly all the electrons and protons have 
been converted into neutrons. At this stage, equilibrium is again 
possible with the star supported by the degeneracy pressure of the 
neutrons. Such a body is called a neutron star. If the neutrons are 


non-relativistic, one finds 
n= Mh. 

If the neutrons are relativistic, the star must again have a mass M,, 
and a radius less than or equal to him,—®. However M,/him,-? = 
and so such a star is near the General Relativity limit M,/R ~ 2. 

The conclusion is that a cold star of mass greater than M,, cannot 
be supported by either electron or neutron degeneracy pressure. To 
show this rigorously, consider the Newtonian equation of support: 


dp/dr = —pM(r)r-, (9.2) 


where M(r) = an {"or dr 
0 
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is the mass within radius r. Multiply both sides of (9.2) by r* and 
integrate by parts from 0 to 79. This gives 


af "prs dr = (M(r)))?/87, 
0 


since p = 0 at r = ry. On the other hand, 


d 7 ? , a 3 : ? ? —t 
art |ernar) =A [aren] ar 
3 1 [rd aE 
= 5 (tort-3 [SB rtar’) pr < 22 pia, 


since dp/dr is never positive. As p is never greater than fin}, this shows 
that 


bar To 3 
i pridr <( nr dr) = f(M(r))¢ (4mm, )-4. 


0 0 


Therefore M(r,) must be less than (8%)? (47)-3 m,~, i.e. 
M(ry) < Shimy~. 


We summarize these results in figure 56. In this diagram we plot the 
average nucleon density n against the total mass M of the body. The 
solid line shows the approximate equilibrium configuration of a cold 
body. In a hot body there will be thermal and radiation pressure in 
addition to degeneracy pressure and so such bodies may be in equi- 
librium above the solidtine. The heavy dashed line on the right indi- 
cates where M/r, (which is Ménim,}) is equal to two. The region to the 
right of this line contains no equilibrium states, and corresponds to 
a star being within its Schwarzschild radius. Far away from this line 
to the left, the difference between Newtonian theory and General 
Relativity may be neglected. Near this line, one has to take into 
account General Relativistic effects. For a static spherically symmetric 
body composed of a perfect fluid, the Einstein field equations can be 
reduced to (see appendix B) 


A 


dp _ _ (+p) (M(r) + 4ar'p) (9.3) 
dr r(r—2M(r)) : 


where the radial coordinate is such that the area of the two-surface 
{r = constant, t = constant} is 47r?. M(r) is now defined as 


P 
i 4nr?u dr, 
0 
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Ficure 56. Nucleon number density n plotted against total mass of a static 
body M. The heavy line shows the equilibrium of cold bodies; hot bodies at 
suitable temperatures can be in equilibrium above this line. General Relativity 
forbids any bodies in the shaded region from being static. 


= 
where 4“ = p(1+€e) is the total energy density, p is nm,, and e¢ is the 
relativistic increase of mass associated with the momentum of the 
fermions. M(r,) is equal to the Schwarzschild mass MW of the exterior 
Schwarzschild solution for r > 79. For a bound star this will be less 
than the conserved mass 


_ { sae 
#- |, come 


nD? 


where N is the total number of nucleons in the star, because the differ- 
ence (Jf —M) represents the amount of energy radiated to infinity 
since the formation of the star from dispersed matter initially at rest. 
In practice this difference is never more than a few percent and in no 
case can it exceed 2M, since Bondi (1964) has shown that (1-2 |r) 
cannot be less than 4 provided u and pare positive and that # decreases 
outwards, and cannot be less than } if p is less than or equal to p. 
Therefore M < M < 3M. 

Comparing (9.3) with (9.2), with # in place of p and M in place of M, 
one sees that the extra terms on the right-hand side of (9.3) are all 
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negative provided € > 0 and p > 0. Thus since in Newtonian theory 
a cold star of mass M > M,, cannot support itself, neither can a cold 
star of Schwarzschild mass M > M, in General Relativity. This means 
that a cold star which contains more than 3I4,/m, nucleons cannot 
support itself. In practice, the extra terms in (9.3) mean that the 
limiting nucleon number is less than M,,/m,. 

In our discussion of neutron stars, we ignored the effects of nuclear 
forces. These will somewhat modify the position of the equilibrium line 
in figure 56 for such stars. For details, see Harrison, Thorne, Wakano 
and Wheeler (1965), Thorne (1966), Cameron (1970), and Tsuruta 
(1971). However they will not affect the important point that a star 
containing slightly more than J4,,/m, nucleons will not have any zero 
temperature equilibrium. This is because the point at which neutrons 
become relativistic in a star of mass M,, almost coincides with the 
General Relativity limit M/R ~ 2. Thus a star containing somewhat 
more than J/,/m, nucleons will not reach nuclear densities until it is 
already inside its Schwarzschild radius. 

The life history of a star will lie in a vertical line on figure 56, unless 
it manages to lose a significant amount of material by some process. 
The star will condense out of a cloud of gas. As it contracts, the 
temperature will rise due to the compression of the gas. If the mass is 
less than about 10-714, the temperature will never rise sufficiently 
high to start nuclear reactions and the star will eventually radiate 
away its heat and settle down to a state in which gravity is balanced 
by degeneracy presstire of non-relativistic electrons. If the mass is 
greater than about 10-7, the temperature will rise high enough to 
start the nuclear reaction which converts hydrogen to helium. The 
energy produced by this reaction will balance the energy lost by 
radiation and the star will spend a long period (~ 101°(M,,//)? years) 
in quasi-static equilibrium. When the hydrogen in the core is 
exhausted, the core will contract and the temperature will rise. 
Further nuclear reactions may now take place, converting helium in 
the core into heavier elements. However the energy available from this 
conversion is not very great, and so the core cannot remain in this 
phase very long. If the mass is less than M,, the star can settle down 
eventually to a white dwarf state in which it is supported by 
degeneracy pressure of non-relativistic electrons, or possibly to a 
neutron star state in which it is supported by neutron degeneracy 
pressure. However if the mass is more than slightly greater than M,,, 
there is no low temperature equilibrium state. Therefore the star must 
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either pass within its Schwarzschild radius, or manage to eject suffi- 
cient matter that its mass is reduced to less than MU, 

Ejection of matter has been observed in supernovae and planetary 
nebulae, but the theory is not yet very well understood. What calcula- 
tions there have been suggest that stars up to 20M, may possibly be 
able to throw off most of their mass and leave a white dwarf or neutron 
star of mass less than M,, (see Weymann (1963), Colgate and White 
(1966), Arnett (1966), Le Blanc and Wilson (1970), and Zel’dovich and 
Novikov (1971)). However it is not really credible that a star of more 
than 20M, could manage to lose more than 95 % of its matter, and so 
one would expect that the inner part of the star at any rate would 
collapse within its Schwarzschild radius. (Present calculations in fact 
indicate that stars of mass M > 5M, would not be able to eject 
sufficient mass to avoid a relativistic collapse.) 

Going to larger masses, consider a body of about 10° Jf. If this 
collapsed to its Schwarzschild radius, the density would only be of the 
order of 10-4 gm cm~3 (less than the density of air). If the matter were 
fairly cold initially, the temperature would not have risen sufficiently 
either to support the body or to ignite the nuclear fuel; thus there 
would be no possibility of mass loss, or uncertainty about the equation 
of state. This example also shows that the conditions when a body 
passes through its Schwarzschild radius need not be in any way 
extreme. ~ 

To summarize, it seems that certainly some, and probably most, 
bodies of mass > M;, will eventually collapse within their Schwarz- 
schild radius, and so give rise to a closed trapped surface. There are at 
least 10° stars more massive than Jf, in our galaxy. Thus there are 
a large number of situations in which theorem 2 predicts the existence 
of singularities. We discuss the observable consequences of stellar 
collapse in the next sections. 


9.2. Black holes 


What would a collapsing body look like to an observer O who remained 
at a large distance from it? One can answer this if the collapse is 
exactly spherically symmetric, since then the solution outside the 
body will be the Schwarzschild solution. In this case, an observer O’ 
on the surface of the star would pass within r = 2m at some time, say 
1 o’clock, as measured by his watch. He would not notice anything 
special at that time. However after he passes r = 2m he will not be 
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FicvRe 57. An observer O who never falls inside the collapsing fluid sphere never 
sees beyond a certain time (say, 1 o’clock) in the history of an observer O’ on 
the surface of the collapsing fluid sphere. 

(i) Finkelstein diagram; (ii) Penrose diagram. 


visible to the observer O who remains outside r = 2m (figure 57). 
However long the observer O waits, he will never see O’ at a time later 
than 1 o’clock as measured by O’’s watch. Instead he will see O”’s 
watch apparently slot down and asymptotically approach 1 o’clock. 
This means that the light he receives from O’ will have a greater and 
greater shift of frequency to the red and as a consequence a greater 
and greater decrease of intensity. Thus although the surface of the star 
never actually disappears from O’s sight, it soon becomes so faint as 
to be invisible in practice. In fact O would first see the centre of the 
disc of the star become faint, and then this faint region would spread 
outwards to the limb (Ames and Thorne (1968)). The time scale for 
this diminution of intensity is of the order for light to travel a 
distance 2m. 

One would be left with an object which, for all practical purposes, is 
invisible. However it would still have the same Schwarzschild mass, 
and would still produce the same gravitational field, as it did before 
it collapsed. One might be able to detect its presence by its gravita- 
tional effects, for instance its effects on the orbits of nearby objects, or 
by the deflection of light passing near it. It is also possible that gas 
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falling into such an object would set up a shock wave which might be 
a source of X-rays or radio waves. 

The most striking feature of spherically symmetric collapse is that 
the singularity occurs within the region r < 2m, from which no light 
can escape to infinity. Thus if one remained outside r = 2m one would 
never see the singularity predicted by theorem 2. Further the break- 
down of physical theory which occurs at the singularity cannot affect 
one’s ability to predict the future in the asymptotically flat region of 
space-time. 

One can ask whether this is the case if the collapse is not exactly 
spherically symmetric. In the previous section we used the Cauchy 
stability theorem to show that small departures from spherical sym- 
metry would not prevent the occurrence of closed trapped surfaces. 
However the Cauchy stability theorem in its present form says only 
that a sufficiently small perturbation in the initial data will produce 
a perturbation in the solution which is small on a compact region. One 
cannot argue from this that a perturbation of the solution will remain 
small at arbitrarily large times. 

We expect that in general the occurrence of singularities will lead 
to Cauchy horizons (as in the Reissner—Nordstr6ém and Kerr solutions) 
and hence to a breakdown of one’s ability to predict the future. 
However if the singularities are not visible from outside, one would 
still be able to predict in the exterior asymptotically flat region. 

To make this precise, we shall suppose that (4,8) has a region 
which is asymptotically flat in the sense of being weakly asympto- 
tically simple and empty (§6.9). There is then a space (4, %) into 
which (4, 8) is conformally imbedded as a manifold with boundary 

MW = M\) eM. where the boundary @ of WM in -@ consists of two 
null surfaces %+ and .¥- which represent future and past null infinity 
respectively. Let Y be a partial Cauchy surface in. #. We shall say that 
the space (.M, §) is (future) asymptotically predictable from F if 4+ is 
contained in the closure of D+(Y) in the conformal manifold ./. 
Examples of spaces which are future asymptotically predictable from 
some surface include Minkowski space, the Schwarzschild solution 
for m > 0, the Kerr solution for m > 0. |a| <m. and the Reissner— 
Niouaetittn solution for m > 0, |e| <m. The Kerr solution with 
|a| > m and the Reissner- Nonidet solution with |e] > m are not 
future asymptotically predictable, since for any partial Cauchy surface 
SF, there are past-inextendible non-spacelike curves from .%+ which do 
not intersect Y but approach a singularity. One can regard future 
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asymptotic predictability as the condition that there should be no 
singularities to the future of Y which are ‘naked’, i.e. which are 
visible from ¥+. 

In a spherical collapse, one gets a space which is future asymp- 
totically predictable. The question is whether this would still be the 
case for non-spherical collapse. We cannot answer this completely, 
Perturbation calculations by Doroshkevich, Zel’dovich and Novikov 
(1966) and Price (1971) seem to indicate that small perturbations from 
spherical symmetry do not give rise to naked singularities. In addition, 
Gibbons and Penrose (1972) have tried, and failed, to obtain contra- 
dictions which would show that in some situations the development 
of a future asymptotically predictable space was inconsistent. Their 
failure does not of course prove that asymptotic predictability will 
hold, but it does make it more plausible. If it does not hold, one cannot 
say anything definite about the evolution of any region of a space 
containing a collapsing star, as new information could come out of the 
singularity. We shall therefore proceed on the assumption that future 
asymptotic predictability holds at least for sufficiently small depar- 
tures from spherical symmetry. 

One would expect a particle on a closed trapped surface to be 
unable to escape to %*. However if one allowed arbitrary singularities 
one could always make suitable cuts and identifications to form an 
escape route for the particle. The following result shows that this is 
not possible in a future asymptotically predictable space. 

4 


Proposition 9.2.1 
ine 

(a) (#,8) is future asymptotically predictable from a partial 
Cauchy surface % 

(b) R,, K?K® > 0 for all null vectors K, 
then a closed trapped surface 7 in D‘(S) cannot intersect J-(4+,.7), 
i.e. cannot be seen from ¥*. 


For suppose 7 n J-(%+, “) is non-empty. Then there would be a 
point pe.%+ in J+(7,.@). Let W be the neighbourhood of -@ which is 
isometric to the neighbourhood @’ of ¢.@’ in the conformal manifold 
Md’ of an asymptotically simple and empty space (.4@’, 8’). Let ’ be 
a Cauchy surface in .@’, which coincides with “ on Yn 4’. Then 
S'—U' is compact and so by lemma 6.9.3, every generator of %+ 
leaves J+(S’ — YU’, W@’). This shows that if W is any compact set of Y, 


eS) 
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every generator of .4+ leaves J+(W,.@). From this it follows that 
every generator of .£+ would leave J+(7,.4), since this is contained 
in J+(J-(7)n F,.M). Therefore a null geodesic generator w of 
J+(7,.M) would intersect .4+. The generator ~ must have past end- 
point at 7, since otherwise it would intersect [~(.7). Since «4 meets 4+ 
it would have infinite affine length. However by the condition (6) every 
null geodesic orthogonal to Y would contain a point conjugate to 7 
within a finite affine length. Thus it could not remain in J+(7,.@) all 
the way out to .%+. This shows that 7 cannot intersect J-(.4+,.#).O 


From the above it follows that a closed trapped surface in D*(Y) in 
a future asymptotically predictable space must be contained in 
M —J~(%*, M). Therefore there must be a non-trivial (future) event 
horizon J-(%+,.M). This is the boundary of the region from which 
particles or photons can escape to infinity in the future direction. By 
§ 6.3 the event horizon is an achronal boundary which is generated by 
null geodesic segments which may have past endpoints but which can 
have no future endpoints. 


Lemma 9.2.2 


If conditions (a), (b) of proposition 9.2.1 are satisfied and if there is 
a non-empty event horizon J-(.7+, ./). then the expansion 6 of the 
null geodesic generators of J-(.%+, ./) is non-negative In 


| -(F*, MO) 1) DP). 


Suppose there was an open set Y such that 0 < 0in Yn J-(.4+, M). 
Let.7 beaspacelike two-surface in Yn J-(4+,.7). Then = y,%, < 0. 

Let ¥ be an open subset of Y which intersects 7 and has compact 
closure contained in Y. One can vary 7 by a small amount in W so 
that 72%, is still negative but such that in Y, 7 intersects J-(4+, H i 

As before, this leads to a contradiction since any generator of 
J+(7, MW) in J-(S+, M) would have past endpoint at 7 in W, where 
it w all be orthogonal to .7. However as ys", < 0inY, every out- 
going null geodesic orthogonal to F in ¥ w not contain a point 
conjugate to 7 within a finite affine distance, and so could not remain 
in J+(7,.M) all the way out to +t, Oo 


In a future asymptotically predictable space, J+(S) J-(I+, M) is 
contained in D*(S/). If there were a point p on the event horizon in 
J*+(S/) which was not in D+(/), the smallest perturbation could lead 
to p being in J~(.4*+, .#), i.e. being visible from infinity, which would 
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mean that the space was no longer asymptotically predictable. It 
therefore seems reasonable to slightly extend the definition of future 
asymptotically predictable, to say that space-time is strongly future 
asymptotically predictable from a partial Cauchy surface Y if 4+ is 
contained in the closure of D+(S) in .@, and J+(S)n J-(%+,.Z) is 
contained in Dt(). In other words, one can also predict a neighbour- 
hood of the event horizon from Y. 


Proposition 9.2.3 


If (4, §) is strongly future asymptotically predictable from a partial 
Cauchy surface , there is a homeomorphism 

a: (0, 00)xS>Dt(S)-Sf 
such that for each re(0,00), Y(r) = ({7}xF) is a partial Cauchy 
surface such that: 

(a) for 7, > 14, A(T.) S L(A (74)); 

(b) for each 7, the edge of Y(7) in the conformal manifold 4 is a 
spacelike two-sphere 2(r) in .%+ such that for 7, > 7,, 2(7,) is strictly 
to the future of 2(7,), 

(c) for each 7, (7) U{%~n J-({(r), W)} is a Cauchy surface in W@ 

foe CH’). 
In other words, (7) is a family of spacelike surfaces homeomorphic 
to S which cover D*(/)—F and intersect 4+ (see figure 58). One 
could regard them as surfaces of constant time in the asymptotically 
predictable region. We choose them to intersect “+ so that the mass 
measured on them at infinity will decrease when the emission of 
gravitational or other forms of radiation takes place. 

The construction for (7) is rather similar to that of proposition 
6.4.9. Choose a continuous family 2(7) («o > 7 > 0) of spacelike two- 
spheres which cover .4*, such that for 7, > 7,, 2(72) is strictly to the 
future of 2(7,). Put a volume measure on .@ such that the total 
volume of -@ in this measure is finite. We first prove: 


Lemma 9.2.4 
k(r), the volume of the set J-(2(r),.W@)n D*(S) is a continuous func- 
tion of 7. 
Let Y be any open set with compact closure contained in 

I-(Q(r), @)n DF). 
Then there are timelike curves from every point of VY to 2(r), which 
can be deformed to give timelike curves to 2(7—6) for some é > 0. 
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Cauchy 
surface for 
D*(Ff) 


Ficurs 58. A space (.%, §) which is strongly future asymptotically predictable 
from a partial Cauchy surface “, showing a family .¥(7) of spacelike surfaces 
which cover Dt(S) —F and intersect .%+ in a family of two-spheres J(7). 


~ 


Given any € > 0, one can find a ¥ whose volume is > /(7)—e. Thus 
there isa d > Osuch that k(7— 0) > k(7)—e. On the other hand, suppose 
there were an open set Y which did not intersect I~(2(r),.W@)n D*(F) 
but which was contained in J-(2(7'),.%)n D+(F) for any Tar 
Then if pe¥, there would be past-directed timelike curves A, from 
each J(7’) to p. As the region of 4+ between 2(r) and 2(7,) is compact 
for any T, > 7, there would be a past-directed non-spacelike curve A 
from 2(r) which was the limit curve of the {A_}. Since the {A,} did not 
intersect I~(2(r),.@), A would not either, and so it would be a null 
geodesic and would lie in I-(Q(r),.@). It would enter .W and so it 
would either have a past endpoint at p, or would intersect Y. The 
former is impossible as it would imply that ¥ intersected I-(2(7),.@), 
and the latter is impossible as pelt(S). This shows that there is no 
open set which is in J~(2(7’),.@) for every 7’ > 7, but which is not in 
I-(Q(r), HM) D*(F). Thus given e, there is a 5 such that 


k(t +0) < k(r) +e. 
Therefore k(T) is continuous. Oo 
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Proof of proposition 9.2.3. Define functions f(p) and h(p,t), pe Dt(F), 
which are volumes of I+(p) and I-(p) — I-(Q(r),.@). Asin proposition 
6.4.9, the function f(p) is continuous on the globally hyperbolic region 
D*(/)—F, and goes to zero on every future-inextendible non- 
spacelike curve. Since J-(2(r),.@ )nN & is a past set, 


D+(S)—I-(Q(r), M)-SF 


is globally hyperbolic. Thus for each 7, h(p,7) is continuous on 
D*(S)—SF. This means that given any e > 0, one can find a neighbour- 
hood % of p such that |h(q,7)—A(p,7)| < de for any ge%. By lemma 
9.2.4, one can find a 6 > 0 such that |k(7r’) —k(z)| < he for |7’—7] < 6. 
Then |h(q.7') —~h(p,7)| < €, which shows that h(p,7) is continuous on 
(D*(S) —F) x (0,00). The surfaces /(7) can then be defined as the set 
of points pe D*(/)—YF such that h(p,7) = 7f(p). Clearly these are 
spacelike surfaces which cover D+(.Y)—Y and satisfy properties 
(a)-(c). 

To define the homeomorphism «@, one needs a timelike vector field 
on D+(S)—¥ which intersects each surface Y(7). We construct such 
a vector field as follows. Let YW be a neighbourhood of 4+ in the 
conformal manifold .#, let X, be a non-spacelike vector field on ¥ 
which is tangent to the generators of %+ on 4+, and let xz, > 0 bea C? 
function which vanishes outside Y and is non-zero on 4+. Let X, be 
a timelike vector field on 4, and let x, > 0 be a CO? function on .# 
which is non-zero on -@ and is zero on %*. Then the vector field 
X = 2,X,+2,X, has the required property. The homeomorphism 
a: D*(S)—-S > (0,0) x F then maps a point pe Dt(S) —F to (7,¢q) 
where 7 is such that pe.Y(T), and the integral curve of X through p 
intersects F at q. O 


If there is an event horizon J~(.%+, .@) in the region D+(S) of a future 
asymptotically predictable space, then it follows from property (6) of 
proposition 9.2.3 that for sufficiently large 7, the surfaces (7) will 
intersect it. We define a black hole on the surface (7) to be a connected 
component of the set B(r) = Y(t) —J-(F*,.@). In other words, it is 
a region of Y(7) from which particles or photons cannot escape to 4*. 
As increases, black holes can merge together, and new black holes 
can form as the result of further bodies collapsing. However, the 
following result shows that black holes can never bifurcate. 
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Proposition 9.2.5 

Let B,(7,) be a black hole on Y(7,). Let #,(7,) and B,(T,) be black 
holes on a later surface (7,). If #,(7,) and A(T.) both intersect 
J+(B,(1,)), then By(72) = Ba(T2). 


By property (c) of proposition 9.2.3, every future-directed inexten- 
dible timelike curve from &,(7,) will intersect /(7,). Thus 


J*(By(71)) N S (72) 


is connected, and will be contained in a connected component of 
BT). fl 


For physical applications, one is interested primarily in black holes 
which form as the result of gravitational collapse from an initially 
non-singular state. To make this notion precise, we shall say that the 
partial Cauchy surface Y has an asymptotically simple past if J-(f) 
is isometric to the region J-(Y’) of some asymptotically simple and 
empty space-time (.4’, 8’), where “’ is a Cauchy surface for (4, 8’). 
By proposition 6.9.4, the surface ”’ has the topology R* and so ¥ also 
has this topology. Proposition 9.2.3 therefore shows that if (-@, §) is 
strongly future asymptotically predictable from a surface “ with an 
asymptotically simple past, then each surface (7) has the topology 
R38, and the union of “(7) with the boundary two-sphere.?(7) on 4* is 
homeomorphic to the unit cube J?. 

Although one is primarily interested in spaces which have asymp- 
totically simple pasts it will in the next section be convenient to con- 
sider future asymptotically predictable spaces which do not have this 
property, but which at large times may closely approximate to spaces 
which do. An example of this is the spherically symmetric collapse we 
considered at the beginning of this section. Once the surface of the star 
has passed inside the event horizon, the metric of the exterior region 
is that of the Schwarzschild solution, and is unaffected by the fate of 
the star. When studying the asymptotic behaviour it is therefore 
convenient simply to forget about the star, and consider the empty 
Schwarzschild solution as a space which is strongly future asymp- 
totically predictable from a surface Y such as that shown in figure 24 
on p. 154. This surface does not have an asymptotically simple past, 
and its topology is S? x R! instead of R?. However the portion of Y 
outside the event horizon in region I hasthe same topology as the region 
outside the event horizon of the surface Y(r) in figure 57. We want to 
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consider spaces which are strongly future asymptotically predictable 
from a surface Y, and are such that the portion of F outside the event 
horizon has the same topology as some surface (7) in a space with an 
asymptotically simple past. Of course in more complicated cases there 
may be several components of &(r), corresponding to the collapse of 
several bodies. We shall therefore consider spaces which are strongly 
future asymptotically predictable from a surface Y, and with the 
property: 

Giey 1 I~ AM) is homeomorphic to R?—(an open set with 
compact closure). 
(Note that this open set may not be connected.) It will also be con- 
venient to demand the property: 

(8) F# is simply connected. 


Proposition 9.2.6 
Let (4, 8) be a space which is strongly future asymptotically predict- 
able from a partial Cauchy surface Y which satisfies (a), (f). Then: 
(1) the surfaces (7) also satisfy (a), (6); 
(2) for each 7, 0F,(r), the boundary in (7) of a black hole Z,(7), is 
compact and connected. 


Since the surfaces (7) are homeomorphic to Y, they satisfy property 
(f). One can define an injective map 
YS (7)N J-(I+, M)>PS 1 I-( I+, M) 

by mapping each point of Y(7) down the integral curves of the vector 
field of X proposition 9.2.3. Since (4, §) is weakly asymptotically 
simple, one can find a two-sphere F near “+ in Y(r)n J-(F+,.Z). 
The portion of /(7) outside FY will map into the region of # outside 
the two-sphere y(#). This shows that the region of Yn J-(.4+,.Z) 
which is not in y(.A(7)N J-(£+,M)) must have compact closure. 
Therefore y(Y(r) 0) J-(.4+, Z)) will be homeomorphic to R3- (an open 
set with compact closure). Since (7) is homeomorphic to k?-W 
where ¥ is an open subset of R® with compact closure, 04(r) will be 
homeomorphic to é¥ and so will be compact. 04,(T) being a closed 
subset of d&(r) will be compact. 

Suppose that ¢#,(7) consisted of two disconnected components 
6B,\(r) and 0F,?(r). One could find curves A, and A, in /(T)— Ar) 
from 2(r) to 6B,\(r) and 0&,?(r) respectively. One could also find a 
curve 4 in int Z,(7) from 0#,1(r) to 0#,?(r). Joining these together one 
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would obtain a closed curve in (7) which crossed 0%,1(7) only once. 
This cannot be deformed to zero in Y(7), contradicting the fact that 
S(r) is simply connected. O 


We are only interested in black holes that one can actually fall into, 
i.e. ones in which the boundary @@(7) is contained in J*+(%-, ZH). We 
shall therefore add to properties (a), (f) the requirement: 

(y) for sufficiently large 7, S(7)n J-(4%+,H) is contained in 
J+(9-, M). 

We shall say that (4, g) is a regular predictable space if it is strongly 
future asymptotically os from a partial Cauchy surface Y 
and if properties («), (f), (y) are satisfied. All the spaces mentioned at 
the beginning of seh sesiem as being future asymptotically predict- 
able are in fact also regular predictable spaces. Proposition 9.2.6 
shows that when one is dealing with regular predictable spaces de- 
veloping from a partial Cauchy surface , there is a one-one corres- 
pondence between black holes #;(7) and their boundaries 6% ,(7) in 
S(T). One could therefore in such a situation give an equivalent defini- 
tion of a black hole as a connected component of A(r)n J-(4*, -Z). 

The next result gives a property of the boundaries of black holes 
which will be important in the next section. 


Proposition 9.2.7 


Let (4, 8) be a regular predictable space developing from a partial 
Cauchy surface , in which R,,A¢K° > 0 for every null vector K¢. 
Let 4(r) be a black hole on the surface (7). and let {%,(7')} 
(2 = 1 to V) be the black holes on an earlier surface Y(r’) which are 
such that J+(A;(7’))n By(7) + @. Then the area A,(r) of @A,(r) is 
greater than or equal to the sum of the areas A,(r’) of @B,(r’); the 
equality can hold only if N = 1. 


In other words, the area of the boundary of a black hole cannot 
decrease with time, and if two or more black holes merge to form a 
single black hole, the area of its boundary will be greater than the areas 
of the boundaries of the original black holes. 

Since the event horizon is the boundary of the past of 4+, its null 
geodesic generators would have future endpoints only if they inter- 
sected %*. However this is impossible, as the null geodesic generators 
of %+ have no future endpoints. Thus the null generators of the event 
horizon have no future endpoints. By lemma 9.2.2, their expansion 0 
is non-negative. Thus the area of a two-dimensional cross-section of 
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the generators cannot decrease with 7. By property (c) of proposition 
9.2.3, and by proposition 9.2.5, all the null geodesic generators of 
J-(%*, M) which intersect F(r (7’) in any of the 2#;(r’) must intersect 
S(T) in #,(r). Thus the area of 0%,(7) is greater than or equal to the 
sum of the areas of the {#;(r')}. When N > 1, 0%,(r) will contain 
N disjoint closed subsets which correspond to the generators of 
J~(4+, M@) which intersect each dF, ,(7'). Since 6#,(7) is connected, it 
must contain an open set of generators which do not intersect any 
0% ,(7’), but have past endpoints between Y(7) and. F(7’). 0 


It has been convenient to define black holes in terms of the event 
horizon J~(.4*,./), because this is a null hypersurface with a number 
of nice properties. However this definition depends on the whole future 
behaviour of the solution; given the partial Cauchy surface (7), one 
cannot find where the event horizon is without solving the Cauchy 
problem for the whole future development of the surface. It is there- 
fore useful to define a different sort of horizon which depends only on 
the properties of space-time on the surface Y(7). 

One knows from proposition 9.2.1 that any closed trapped surface 
on (rT) in a regular predictable space developing from a partial 
Cauchy surface Y must be in &(r). This result depends only on the fact 
that the outgoing null geodesics orthogonal to the two-surface are 
converging. It does not matter whether the ingoing null geodesics are 
converging or not. We shall therefore say that an orientable compact 
spacelike two-surface in D+ (Ff) is an outer trapped surface if the 
expansion @ of the outgoing null geodesics orthogonal to it is non- 
positive. (We include the case 9 = 0 for convenience.) In order to 
define which is the outgoing family of null geodesics we make use of 
property (/) of the partial Cauchy surfaces (7). Let X be the timelike 
vector field of proposition 9.2.3. Then any compact orientable space- 
like two-surface F in D+(S) can be mapped by the integral curves of X 
into a compact orientable two-surface F’ in Y(7), for any given value 
ofr. Let A bea curve in Y(7) U 2(r) from 2(r) to #’ which intersects F” 
only at its endpoint. Then one can define the outgoing direction on 
Ff in SF(r) as the direction for which A approaches #’. As S(T) is 
simply connected, this definition is unique. The outgoing family of 
null geodesics orthogonal to F is then that family which is mapped 
by X onto curves in “(r) which are outgoing for 7’. 

Knowing the solution on the surface (7), one can find all the outer 
trapped surfaces F which lie in (7). We shall define the trapped 
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region J (rT) in the surface Y(r) as the set of all points geEF(T) such 
that there is an outer trapped surface F lying in Y(r), through q. As 
is shown by the following result, the existence of the trapped region 
T(r) implies the existence of a black hole A(7), and in fact 7 (7) lies 
in &(r) for each value of 7. 


Proposition 9.2.8 

Let (4, &) be a regular predictable space developing from a partial 
Cauchy surface £, in which R,,K*K® > 0 for any null vector K*. 
Then an outer trapped surface F in Dt+(/) does not intersect 


JAP), 


The proof is similar to that of proposition 9.2.1. Suppose F intersects 
J-(4+, M@). Then J+(P,.@) would intersect .%+. To each point of 
I+ J+(P, M) there would be a past-directed null geodesic generator 
of J+(P,.@) which had past endpoint on Y, and which contained no 
point conjugate to Y. By (4.35) the expansion @ of these generators 
would be non-positive, as it is non-positive at P and as R,, K*K” > 0. 
Thus the area of a two-dimensional cross-section of the generators 
would always be less than or equal to the area of Y. This establishes 
a contradiction, as the area of +n J+(P,./) is infinite, as it is at 
infinity. O 
~ 

We shall call the outer boundary @.7,(7T) of a connected component 
TJ, (rT) of the trapped region .7 (r),an apparent horizon. By the previous 
result, the existence of an apparent horizon @.7,(7) implies the existence 
of a component 04,(r) of the event horizon outside it, or coinciding 
with it. However the converse is not necessarily true: there may not 
be outer trapped surfaces within an event horizon. 

On the other hand, there may be more than one connected com- 
ponent of 7(r) within one component é4,(7) of the event horizon. 
These possibilities are illustrated in figure 59. A similar situation arises 
when one considers the collision and merger of two black holes. On an 
initial surface Y(7,), one would have two separate trapped regions 
F,(7,) and .7F,(7,) contained in black holes (7), and B,(7,) respec- 
tively. As they approached each other, the two components 0&,(T) and 
0%,(T) of the event horizon would amalgamate to form a single black 
hole 4,(7,) on a later surface Y(1,). The apparent horizons 0.7;(7) and 
07,(7) would however not join up immediately. Instead what would 
happen is that a third trapped region 7,(7) would develop surrounding 
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Ficure 59. The spherica} collapse of a star of mass m, followed by the 
spherical collapse of avshell of matter of mass dm; the exterior solution will be 
a Schwarzschild solution of mass m after the collapse of the star, and a 
Schwarzschild solution of mass m+ 46m after the collapse of the shell. At time 


7, there is an event horizon but no apparent event horizon; at time 7, there are 
two apparent horizons within the event horizon. 


them both (figure 60). At some later time, 7,, 7, and 7; might merge 
together. 

We shall only outline the proofs of the principal properties of the 
apparent horizon. First of all one has: 


Proposition 9.2.9 

Each component of 0.7 (7) is a two-surface such that the outgoing 
orthogonal null geodesics have zero convergence 6 on @7 (7). (We shall 
call such a surface, a marginally outer trapped surface.) 

If 6 were positive in a neighbourhood in 0.7 (7) of a point ped (7), 
then there would be a neighbourhood Y of p such that any outer 


322 GRAVITATIONAL COLLAPSE [9.2 


Wi 
rs 
ee 


Ce 
PTT WG IN 
Md | 
\ 


oe 


S79) 


New apparent 
horizon 


Event horizon 


‘Black hole’ 


Apparent horizon 


Ficure 60. The collision and merging of two black holes. At time 7,, there are 
apparent horizons @7,, 0F, inside the event horizons 64,, @#, respectively. 
By time 7,, the event horizons have merged to form a single event horizon; 
a third apparent horizon has now formed surrounding both the previous 
apparent horizons. 


trapped surface in /(7) which intersected @ would also intersect 
a7 (7). Thus 6 < 0 on 07 (7). 

If 8 were negative in a neighbourhood in ¢@7 (7) of a point p € éF (7), 
one could deform ¢.7 (7) outwards in (7) to obtain an outer trapped 
surface outside 07 (7). Oo 


The null geodesics orthogonal to the apparent horizon ¢.7(r) on a 
surface /(7) will therefore start out with zero convergence. However 
if they encounter any matter or any Weyl tensor satisfying the 
generality condition ($4.4), they will start converging, and so their 
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intersection with a later surface Y(7’) will lie inside the apparent 
horizon @7 (7’). In other words, the apparent horizon moves outwards 
at least as fast as light; and moves out faster than light if any matter 
or radiation falls through it. As the example above shows, the apparent 
horizon can also jump outwards discontinuously. This makes it 
harder to work with than the event horizon, which always moves in 
a continuous manner. We shall show in the next section that the event 
and apparent horizons coincide when the solution is stationary. One 
would therefore expect them to be very close together if the solution 
is nearly stationary for a long time. In particular, one would expect 
their areas to be almost the same under such circumstances. If one has 
a solution which passes from an initial nearly stationary state through 
some non-stationary period to a final nearly stationary state, one can 
employ proposition 9.2.7 to relate the areas of the initial and final 
horizons. 


9.3. The final state of black holes 


In the last section, we assumed that one could predict the future far 
away froma collapsing star. We showed that this implied that the star 
passed inside an event horizon which hid the singularities from an 
outside observer. Matter and energy which crossed the event horizon 
would be lost for ever from the outside world. One would therefore 
expect that there would be a limited amount of energy available to 
be radiated to infinity in the form of gravitational waves. Once most 
of this energy had been emitted, one would expect the solution outside 
the horizon to approach a stationary state. In this section we shall 
therefore study black hole solutions which are exactly stationary, in 
the expectation that the exterior regions will closely represent the 
final states of solutions outside collapsed objects. 

More precisely, we shall consider spaces (.4, 8) which satisfy the 
following conditions: 

(1) (4,8) is a regular predictable space developing from a partial 
Cauchy surface S. 

(2) There exists an isometry group 0,: 4 —-# whose Killing vector 
K is timelike near %* and 7-. 

(3) (4, 8) is empty or contains fields like the electromagnetic field 
or scalar field which obey well-behaved hyperbolic equations, and 
satisfy the dominant energy condition: 7,,,N¢L’ > 0 for future- 
directed timelike vectors N, L. 
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We shall call a space satisfying these conditions, a stationary regular 
predictable space. We expect that for large values of 7, the region 
J-(I+, M)n J+(f(7)) of a regular predictable space containing 
collapsing stars will be almost isometric to a similar region of a 
stationary regular predictable space. 

The justification for condition (3) is that one would expect any 
non-zero rest-mass matter eventually to fall through the horizon. Only 
long range fields like the electromagnetic field would be left. Conditions 
(2) and (3) imply that (4, 8) is analytic in the region near infinity 
where the Killing vector field K is timelike (Miller zum Hagen (1970)). 
We shall take the solution elsewhere to be the analytic continuation 
of this outer region. The stationary solutions we are considering here 
will not have asymptotically simple pasts, as they represent only the 
final state of the system and not the earlier dynamical stage. However 
we shall be concerned only with the future properties of these solutions, 
and not their past properties. These might not be the same, as there 
is no a priori reason why they should be time reversible, though in fact 
it will be a consequence of the results we shall prove that they are time 
reversible. i 

In a stationary regular predictable space, the area of a two-section 
of the horizon will be time independent. This gives the following 
fundamental result: 


Proposition 9.3.1 

Let (4, 8) be a stationary, regular predictable space-time. Then the 
generators of the future event horizon J-(.%*+,.#) have no past end- 
points in J+(.%-,./). Let Y,* be the future-directed tangent vectors 
to these generators; then in J+(.%-,./#), Y,* has zero shear G and 
expansion 6, and satisfies 


Ray Y,4 Y, i =0= Vy Catera Y, fl Y,? ee 


In order not to break up the discussion we shall defer the proof of this 
and other results to the end of this section. This proposition shows that 
in a stationary space-time, the apparent horizon coincides with the 
event horizon. 

We shall now present some results which indicate that the Kerr 
family of solutions (§5.6) are probably the only empty stationary 
regular predictable space-times. We shall not give the proofs of the 
theorems of Israel and Carter here, but shall refer to the literature. 
The other results will be proved at the end of this section, Because of 
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these results, we expect that the solution outside an uncharged 
collapsed object will settle down to a Kerr solution. If the collapsed 
body had a net electric charge, we would expect the solution to 
approach one of the charged Kerr solutions. 


Proposition 9.3.2 


Each connected component in J+(.%~,.@) of the horizon é&(7) in a 
stationary regular predictable space is homeomorphic to a two-sphere. 


It is possible that there could be several connected components of 
¢.4(r) representing several black holes at constant distances from each 
other. This situation can occur in the limiting case where the black 
holes have charge e equal to their mass m, and are non-rotating (Hartle 
and Hawking (1972a)). It seems probable that this is the only case in 
which one can get a sufficiently strong repulsive force to balance the 
gravitational attraction between the black holes. We shall therefore 
consider solutions where ¢A(r) has only one connected component. 


Proposition 9.3.3 

Let (.4, 8) be a stationary regular predictable space. Then the Killing 
vector A@ is non-zero in J +(.4-, @)n J-(.4+, M), which is simply con- 
nected. Let 7, be such that ¥(T))N J~($+,M) is contained in 
J~(%-,.W). If @B(r 9) has only one connected component, then 
pe = Wi Sa, M)N AM is homeomorphic to [0, 1) x S?x Ri. 


The discussion now takes one of two possible courses, depending on 
whether or not the Killing vector K*has zero curl, K,.,K,9@%, every- 
where. If the curl is zero, the solution is said to be a static regular 
predictable space-time. Roughly speaking, one would expect the 
solution to be static if the black hole is not rotating in some sense. 


Proposition 9.3.4 

In a static regular predictable space-time, the Killing vector K is 
timelike in the exterior region J*+(.%-, @)n J-(4*, @) and is non-zero 
and directed along the null generators of J~(%*, @) on 

J-(I+, M)n Jt+(F-,M). 

Since the curl of K vanishes, it is hypersurface orthogonal, i.e. there is 
a function & such that K, is proportional to £.,. One can then decom- 
pose the metric in the exterior region in the form g,, = fUK, Ky tha 
where f= K¢K, and h,, is the induced metric in the surfaces 
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{€ = constant} and represents the separation of the integral curves 
of K*. The exterior region therefore admits an isometry which sends 
a point on a surface & to the point on the surface —& on the same 
integral curve of K. This isometry reverses the direction of time, and 
a space admitting such an isometry will be said to be time symmetric. 
Thus if the analytic extension of the exterior region contains a future 
event horizon J-(.£+,.@), it will also contain a past event horizon 
J+(%-, M). These event horizons may or may not intersect; the 
Schwarzschild solution and the Reissner-Nordstrom solution with 
e? < m? are examples where they do intersect, and the Reissner— 
Nordstrém solution with e? = m* is an example where they do not. 
The gradient of f is zero on the horizon in the latter case, but not in the 
former cases. The significance of this comes from the fact that on the 
future horizon J-(.4+, W@)n J+(4-, M), Kay K° = 4f,, = BK,, where 
£ > 0 is constant along the null geodesic generators of J-(.%+,.@). 
Let v be a future-directed affine parameter along such a generator. 
Then K = @ 6/év where a is a function along the generator which obeys 
da/dv = £. If 8 + 0 and the generator is geodesically complete in the 
past direction, a and the Killing vector K will be zero at some point. 
This point cannot lie in J+(.%-,./#), and so will be a point of inter- 
section of the future event horizon J~(.%+,./) and the past event 
horizon J+(.%-, #) (Boyer (1969)). If # = 0.K will always be non-zero 
and there will be no such point where the horizon bifureates. 


Israel (1967) has shown that a static regular predictable space-time 
must be a Schwarzschild solution if: 

(a) bes > 0; 

(b) the magnitude f = A*K, of the Killing vector has non-zero 
gradient everywhere in J+(.%-, #)n J-(.£+, M); 

(c) the past event horizon J+(.%-,./) intersects the future event 
horizon J~(.%+,.@) in a compact two-surface F. 


(It follows from (c) and proposition 9.3.2 that F is connected and has 
the topology of a two-sphere. Israel did not give the conditions in this 
precise form, but these are equivalent.) Israel (1968) has further shown 
that the solution must be a Reissner—Nordstrém solution if the empty 
space condition (a) is replaced by the requirement that the energy— 
momentum tensor is that of an electromagnetic field. Miller zum 
Hagen, Robinson and Seifert (1973) have removed condition (6) in 
the vacuum case. 

From these results we expect that if the final state of the solution 
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outside the event horizon is static, then the metric in the exterior 
region will be that of a Schwarzschild solution. 


We shall now consider the case where the final state of the exterior 
solution is stationary but not static. We would expect this to be the 
case when the object that collapsed was rotating initially. 


Proposition 9.3.5 


In an empty stationary regular predictable space which is not static, 
the Killing vector A“ is spacelike in part of the exterior region 
Pt(9-, AOI +, M). 


The region of TAT ae I (F*, M4) on which K® is spacelike, is 
called the ergosphere. From proposition 9.3.4 it follows that there is no 
ergosphere if the solution is static. The significance of the ergosphere 
is that in it, it is impossible for a particle to move on an integral curve 
of the Killing vector A%, i.e. to remain at rest as viewed from infinity. 
Since the ergosphere is outside the horizon it is still possible for such 
a particle to escape to infinity. An example of a stationary non-static 
regular predictable space with an ergosphere is the Kerr solution for 
a? < m* (§ 5.6). 

Penrose (1969), Penrose and Floyd (1971) have pointed out that one 
can extract a certain amount of energy from a black hole with an 
ergosphere, by throwing a particle from infinity into the ergosphere. 
Since the particle moveson a geodesic, Hy = —p,.*.K, > 0 is constant 
along its trajectory ; 


(Dot Kg). 5 Po? = (Py pho ya + Po" Kao Po° = 0, 

as po’ is a geodesic vector and A“ is a Killing vector), where po” = mv 9% 
is the momentum vector of the particle, m is its rest-mass and Vo is the 
unit tangent to the particle world-line. The particle is then supposed 
to split into two particles with momentum vectors p,* and p,", where 
Pot = Pit + po". Since K* is spacelike, it is possible to choose p,* to be 
a future pointing timelike vector such that Z, = —p,*K, < 0. Then 
E, =—p,“K, will be greater than Ey. This means that the second 
particle can escape to infinity where it will have more energy than the 
original particle that was thrown in. One has thus extracted a certain 
amount of energy from the black hole. 

The particle with negative energy cannot escape to infinity, but 
must remain in the region where K@ is spacelike. Suppose that the 
ergosphere did not intersect the event horizon J-(4+,M). Then the 
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particle would have to remain in the exterior region. By repeating the 
process, one could continue to extract energy from the solution. As 
one did this, one would expect the solution to change gradually. How- 
ever the ergosphere cannot shrink to zero, as there has to be somewhere 
for these negative energy particles to exist. It therefore appears that 
either one could extract an infinite amount of energy (which seems 
improbable), or that the ergosphere would eventually have to intersect 
the horizon. We shall show that in the latter case the solution would 
spontaneously become either axisymmetric or static without any 
further extraction of energy by the Penrose process. Either the possi- 
bility of the extraction of an infinite amount of energy or the occur- 
rence of a spontaneous change would seem to indicate that the original 
state of the black hole was unstable. It therefore seems reasonable to 
assume that in any realistic black hole situation the ergosphere 
intersects the horizon. 

Hajicek (1973) has shown that the stationary limit surface, which is 
the outer boundary of the ergosphere, will contain at least two 
integral null geodesic curves of A?. If the gradient of f is non-zero on 
these curves, and if they are geodesically complete in the past, they 
will contain points where A“ is zero. However there can be no such 
points in the exterior region (see proposition 9.3.3), so the ergosphere 
must intersect the horizon in this case. However although it might be 
reasonable to assume that the integral curves of A“ weracomplete in 
the future, it does not seem reasonable to assume that they are com- 
plete in the past, since that would be to assume something about the 
past region of the solution which, as we said before, is not physically 
significant. In the static case one could show that the solution was 
time symmetric, but there is no a priori reason why a stationary non- 
static solution should be time symmetric. For this reason we shall rely 
on the energy extraction argument above rather than on Hajicek’s 
results, to justify our assumption that the ergosphere intersects the 
horizon. 

One can explain the significance of the ergosphere touching the 
horizon as follows. Let 2, be one connected component of 


J-(I+, M) 1 J+($-,M) 


and let Y, be the quotient of 2, by its generators. By propositions 
9.3.1 and 9.3.2, this will be homeomorphic to a two-sphere. By proposi- 
tion 9.3.1, the spatial separation of two neighbouring generators is 
constant along the generators, and so can be represented by an induced 
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metric h on Y,. The isometry 6, moves generators into generators, and 
so acts as an isometry group of (Y,, h). If the ergosphere intersects the 
horizon, A“ will be spacelike somewhere on the horizon and the action 
of 4, on (Y,,h) is non-trivial. Therefore it must correspond to a rota- 
tion of the sphere Y, around an axis, and the orbits of the group in Y, 
will be two points, corresponding to the poles, and a family of circles. 
A particle moving along one of the generators of the horizon would 
therefore appear to be moving relative to the frame defined byeKe 
which is stationary at infinity. One could therefore say that the 
horizon was rotating with respect to infinity. 

The next result shows that a rotating black hole must be 
axisymmetric. 


Proposition 9.3.6 


Let (4,8) be a stationary non-static regular predictable space, in 
which the ergosphere intersects J~(%*,./)n J+(%-, M@). Then there 
is a one-parameter cyclic isometry group O. (0<¢@ < 27) of (.4,§) 
which commutes with 6,, and whose orbits are spacelike near 4+ 
and #-. 


The method of proof of proposition 9.3.6 is to use the analyticity of 
the metric § to show that there is an isometry O5 in a neighbourhood 
of the horizon. One then extends the isometry by analytic continua- 
tion. The method would therefore work even if the metric were not 
analytic in isolated regions away from the horizon, for example if there 
were a ring of matter or a frame of rods around the black hole. This 
leads to an apparent paradox. Consider a rotating star surrounded by 
a stationary square frame of rods. Suppose that the star collapsed to 
form a rotating black hole. If the black hole approached a stationary 
state, it would follow from proposition 9.3.6 that the metric § was 
axisymmetric except where it was non-analytic at the rods. However 
the gravitational effect of the rods would prevent the metric being 
axisymmetric. The resolution of the paradox seems to be that the 
black hole would not be in a stationary state while it was rotating. 
What would happen is that the gravitational effect of the rods would 
distort the black hole slightly. The back reaction on the frame would 
cause it to start rotating and so to radiate angular momentum. 
Eventually the rotation of both the black hole and the frame would be 
damped out and the solution would approach a static state. A static 
black hole need not be axisymmetric if the space outside it is not 
empty, i.e. if condition (a) of Israel’s theorem is not satisfied. 
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The above discussion indicates that a realistic black hole will never 
be exactly stationary while it is rotating, as the universe will not be 
exactly axisymmetric about it. However in most circumstances, the 
rate of slowing down of the rotation of the black hole is extremely slow 
(Press (1972), Hartle and Hawking (19726)). Thus it is a good approxi- 
mation to neglect the small asymmetries produced by matter at a 
distance from the black hole, and to regard the rotating black hole as 
being in a stationary state. We shall therefore now consider the 
properties of a rotating axisymmetric black hole. 

The following result of Papapetrou (1966), generalized by Carter 
(1969), shows that the Killing vectors K% corresponding to the time 
translation 6, and K“ corresponding to the angular rotation b are 
both orthogonal to families of two-surfaces. 


Proposition 9.3.7 
Let (.4, 8) be a space-time which admits a two-parameter abelian 
isometry group with Killing vectors §, and &,. Let Y be a connected 
open set of 4, and let way = F410 S201 If. 

(a) Ww, Ro. — ony. 

(b) w., = 0 at some point of Y, 
then Wiap:cWde = OonV. 


Condition (6) is satisfied in a stationary axisymmetric space-time on 
the axis of axisymmetry, i.e. the set of points where K¢ = 0. Condition 
(a) is satisfied in empty space, and when the energy-momentum tensor 
is that of a source-free electromagnetic field (Carter (1969)). By 
Frobenius’ theorem (Schouten (1954)), the vanishing of way. . Ware 18, 
when w,, + 0, the condition that there should exist locally a family of 
two-surfaces which are orthogonal to w,,, i.e. to any linear combina- 
tion of , and &,. In the case of a stationary axisymmetric space-time, 
this means that one can locally introduce coordinates (t, ¢, 21, x?) such 
that K = 0/ét, K = 0/@¢, and A¢2”,, = 0 = Raging, forun:=«1,2eDhie 
metric then locally admits the isometry (t, 6, 21,2?) >(—t, —¢, x}, 2°), 
which reverses the direction of time, i.e. it is time-symmetric. Thus if 
the analytic extension of metric near infinity of an empty stationary 
regular predictable space-time contains a future event horizon, it will 
also contain a past event horizon. 
In analogy with proposition 9.3.4, one has 
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Proposition 9.3.8 (cf. Carter (1971b)) 


Let (-4, 8) be a stationary axisymmetric regular predictable space- 
time in which w,,5.,Wz, = 0, where w,, = K,, Ky). Then at any point 
in the exterior region J+(%-,./@)n J-(%+, M) off the axis K = 0, 
h = w,,w® is negative. On the horizons J-(.%+, M@) J+(%-, M) and 
J+(S-, M)n J-(F+, M), h is zero but Way + 0 except on the axis. 


This shows that at each point off the axis in the exterior region, there 
is some linear combination of the Killing vectors K¢ and K¢ which is 
timelike. Outside the ergosphere, A“ itself is timelike, but between the 
stationary limit surface and the horizon one has to add a multiple of 
K“ to obtain a timelike Killing vector. On the horizon there is no linear 
combination which is timelike, but there is a linear combination which 
is null, and is directed along the null generators of the horizon. Off the 
axis K = 0, one can locally characterize the horizon as the set of points 
on which h = w,,w™ = 0. 

We now come to the theorem of Carter (19716) which indicates that 
the Kerr solutions are probably the only empty stationary black holes. 
He considered stationary regular predictable spaces which satisfy: 

(a) Ta, = 0, 

(6) they are axisymmetric, 

(c) the past event horizon J+(.%-,.@) intersects the future event 
horizon J-(.%+,./) in a compact connected two-surface F, 


(By proposition 9.3.2, this will be a two-sphere.) He showed that such 
solutions fall into disjoint families, each of which depends only on two 
parameters. The two parameters can be taken to be the mass m and 
angular momentum JL as measured from infinity. One such family is 
known, namely the Kerr solutions for m > 0, a? < m?, where a = L/m. 
(The Kerr solutions with a? > m? contain naked singularities and so 
are not regular predictable spaces.) It seems unlikely that there are 
any other disjoint families. It has been conjectured, therefore, that the 
solution outside an uncharged-collapsed object will settle down to a 
Kerr solution with a? < m?. This conjecture is supported by analyses 
of linear perturbations from aspherical collapse by Regge and Wheeler 
(1957), Doroshkevich, Zel’dovich and Novikov (1966), Vishveshwara 
(1970), and Price (1972). 

Assuming the validity of this Carter-Israel conjecture, one would 
expect the area of the two-surface 0M(r) in the event horizon to 
approach the area of a two-surface in the event horizon r = 7, of a 
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Kerr solution with the same mass and angular momentum, as 
measured at 2(r) on 4+. This area is 87m(m + (m?— a?)t), where m is 
the mass of the Kerr solution and ma is the angular momentum. (If 
the collapsing body has a net electrical charge e one would expect the 
solution to settle down to a charged Kerr solution. The area of a two- 
surface in the event horizon of such a solution is 


4m (2m? — e? + 2m(m? — a? — e?)4). 


Using this expression one can generalize our results to charged black 
holes.) Consider a collapse situation which by a surface /(7,) has 
settled down to a Kerr solution with mass m, and angular momentum 
m,4,. Suppose one now lets the black hole interact with particles or 
radiation for a finite time. The solution will eventually settle down, by 
a surface (7,), to a different Kerr solution with parameters mg, ap. 
From the discussion of § 9.2, the area of é4(7,) must be greater than 
or equal to the area of 0#(r,). In fact it must be strictly greater than, 
since 6 can be zero only if no matter or radiation crosses the horizon. 
This then implies that 


M2(M+ (M2 — ay”)2) > my (mM, + (m2 — a,”)3). (9.4) 


If a, + 0, then the inequality (9.4) allows m, to be less than m,. Since 
there is a conservation law for total energy and momentum in an 
asymptotically flat space-time (Penrose (1963)), this would mean that 
one had extracted a certain amount of energy from the black hole. 
One way of doing this would be to construct a square frame of rods 
about the black hole and employ the torque exerted by the rotating 
black hole on the frame to do work. Alternatively, one could use 
Penrose’s process of throwing a particle into the ergosphere, where it 
divides into two particles, one of which escapes to infinity with greater 
energy than the original particle. The other particle will fall through 
the event horizon and reduce the angular momentum of the solution. 
One can thus regard the process as extracting rotational energy from 
the black hole. Christodoulou (1970) has shown that one can achieve 
a result arbitrarily near the limit set by the inequality (9.4). In fact the 
maximum energy extraction occurs when a, = 0; then the available 
energy (m,—,) is less than 


{ a,2\3\% 
m|t—e(1+ (tas) ) 


Consider now a situation in which two stars a long way apart collapse 
to produce black holes. There is thus some 7’ such that @4(7') consists 
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of two separate two-spheres 0F,(r7’) and 0B,(7'). Since these are a long 
way apart, one can neglect their interaction and assume that the solu- 
tions near each are close to Kerr solutions with parameters mMy,, a, and 
Mg, 42 respectively. Thus the areas of 0,(7’) and a&,(r') will be 
approximately 87m,(m,+(m,2—a,2)3) and 877m,(Mz + (M2 — Ay?)) 
respectively. Now suppose that these black holes fall towards each 
other, collide and coalesce. In such a collision a certain amount of 
gravitational radiation will be emitted. The system will eventually 
settle down by a surface (7) to resemble a single Kerr solution with 
parameters m,, a,. By the same argument as previously, the area of 
04(7") must be greater than the total area of @#(7'), which is the sum 
of the areas 0#,(r’) and 0&,(7’). Thus 


m,;(Mmsz + (m3"— ag”)®) > m,(m, + (mM? — a,")2) + m.(m, + (mo? — Q5")8). 


By the conservation law for asymptotically flat spaces, the amount of 
energy carried away to infinity by gravitational radiation is 


M+ Ms,— Mz. 
This is limited by the above inequality. The efficiency 
€ = (m,+m,—mMs;) (Mm, +m) 


of conversion of mass to gravitational radiation is always less than 4. 
If a, = a, = 0, then e < 1—1/,/2. It should be stressed that these are 
upper limits; the actual efficiency might be much less, although the 
mere existence of a limit might suggest that one could attain an 
appreciable fraction of it. 

We have shown that the fraction of mass which can be converted 
to gravitational radiation in the coalescence of one pair of black holes 
is limited. However if there were initially a large number of black 
holes, these could combine in pairs and then the resulting holes could 
combine, and so on. On dimensional grounds one would expect the 
efficiency to be the same at each stage. Thus one would eventually 
convert a very large fraction of the original mass to gravitational 
radiation. (This argument was suggested by C. W. Misner and M.J. 
Rees.) At each stage, the energy emitted in gravitational radiation 
would be larger. This might be able to explain Weber’s recent observa- 
tions of short bursts of gravitational radiation. 

We now give the proofs of the propositions we have stated in this 
section. For convenience, we repeat the statements of the propositions. 
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Proposition 9.3.1 
Let (.4, 8) be a stationary, regular predictable space-time. Then the 
generators of the future event horizon J-(%+,M) have no past end- 
points in J+(%-, .@). Let Y,2 be the future-directed tangent vector to 
these generators; then in J+(-, @), Y,* has zero shear G and expan- 
sion 6, and satisfies 

ie Gud hi =0= Ve Capaa yp N- 
Let @ be a spacelike two-sphere on .%—. Then one can cover .%~ by a 
family of two-spheres @(t) obtained by moving @ up and down the 
generators of %- under the action of 6,, i.e. @(t) = O(¢). We now 
define the function x at the point peJ+(%~,.%) to be the greatest 
value of ¢ such that peJ+(¢(t), W). Let Y be a neighbourhood of .4+ 
and .¥— which is isometric to a corresponding neighbourhood of an 
asymptotically simple space-time. Then x will be continuous and have 
some lower bound x’ on. /n Y. From this it follows that x will be con- 
tinuous in the region of J-(.4+,.@) where it is greater than x’. Let 
ped*(F-,M)n J-(¥+, MW). Then under the isometry 6,, p will be 
moved into the region of J~(4+, W), where x > x’. However 

X] op) me |p +1. 

Therefore x will be continuous at p. 

Let t)>0 be such that S(7))n J-(%7,.M) is centained in 
J+(%-, M). Let A be a generator of J-(.4+,./) which intersects oS 7,): 
Suppose there were some finite upper bound x, to x on A. Since the 
space is weakly asymptotically simple, x 0o as one approaches 2(7,) 
on (74). Thus there will be some lower bound x, of x on 

S (74) 0 J—( ae, MD). 
Under the action of the group 6,, A is moved into another generator 
6,(A). As the generators of J-(.%+,.%) have no future endpoints, the 
past extension of @,(A) will still intersect Y(7,) N JAF +, M). This leads 
to a contradiction, since the upper bound of x on 0,(A) would be less 
than 2, iff < x,—2p. 

Let x, be the upper bound of x on (7) 9 J-(.4+,.H). Then every 
generator A of J-(.4+,.4@) which intersects £ (7>)) will intersect 

Fit) =J* C(t), M) 0 IS, MM) for t> x. Every generator of 
3 (£+, MH) which intersects F(t’) will ee O(S (79) fort > t' ~a,. 
But 6(A(7))n J-( 4+, M) = 0 F(T) n J-(.%+,.@)) is compact. Thus 
F (t) is compact. 
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Now consider how the area of F (t) varies as t increases. Since 6 > 0 
the area cannot decrease. If 9 were > Oonan open set, the area would 
increase. Also if the generators of the horizon had past endpoints on 
F (t) the area would increase. However as F(t) is moving under the 
isometry 0,, the area must remain the same. Therefore 6 = 0, and 
there are no past endpoints on the region of J-(.%+,.W@) for which 
« > x». However since each point of J-(.%+,.Z@)n J+(.4-, @) can be 
moved by the isometry 6, to where x > x, this result applies to the 
whole of J-(.%+,.@)n J*(%~, @). From the propagation equations 
(4.35) and (4.36) one then finds G,,,=0, R,,Y,*Y,2»=0 and 
ViteCaweta ap Xy2Nio = 0, where Y,% is the future-directed tangent 
vector to the null geodesic generators of the horizon. 0 


Proposition 9.3.2 


Each connected component in J+(.4-,./) of the horizon @#(r) in a 
stationary, regular predictable space is homeomorphic to a two-sphere. 


Consider how the expansion of the outgoing null geodesics orthogonal 
to 0&@(r) behaves if one deforms @@(r) slightly outwards into 
J-(£+, M). Let Y,* be the other future-directed null vector orthogonal 
to é&(r), normalized so that Y,*¥,, = —1. This leaves the freedom 
Y,-Y,' = e”Y,, Y,> Y,’ = e-” Y,. The induced metric on the space- 
like two-surface 0A(r) is ha, = Gan +YiaYon +YoaYip. Define a family of 
surfaces F (7, w) by moving each point of 6A(T) a parameter distance w 
along the null geodesic curve with tangent vector Y,’. The vectors Y,* 
will be orthogonal to A (r, w) if they propagate according to 


tee en, ee and «Y,*¥,,=—1. 
Then 
(Y2, p24), Yeh cht, = heap, yh? +p° py 
— ht Y,%, hY,,, ph, + Rs, Vo Yychh",, (9.5) 
where p% = — hey, » 21°. Contracting with it, one obtains 


dé x 
ap = (ee (LW 


= Py, qh?4— Rye ¥,7¥ oo + Radon Yi? Yo" YoY? + Pap" 
cae Vie GE h,. 


On the horizon, Y,*.,A°¢h®, is zero, as the shear and divergence of the 
horizon are zero. Under a rescaling transformation Y,’ = e’Y,, 
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Y,’=e-"Y,, the vector p? changes to p’* = pr+ hey .p, and so 
dO/dw|,,.9 changes to 


av = Py; alt +y, sqh?4— RY Ye 
W lw=0 
+ Boao Yi7¥2°Fa2%; + p'%p'a. (9.6) 


The term ¥, 5g h°@ is the Laplacian of y in the two-surface 0@(r). By 
a theorem of Hodge (1952), one can choose y so that the sum of the 
first four terms on the right of (9.6) is a constant on C#(r). The sign of 
this constant will be determined by that of the integral of 


(— BaeVy? Yeo + Roan Yi Ve" %27%,’) 


over 0B(T) (pp. ,f°4, being a divergence, has zero integral). This 
integral can be evaluated using the Gauss—Codacci equations for the 
scalar curvature R of the two-surface with metric h: 


Rix Rag hh) = RR VPP esd 
since 0 = & = 0 on @A(r). By the Gauss—Bonnet theorem (Kobayashi 


and Nomizu (1969)) ite: ‘ 
| KdS = 27x, 
dB(r) 


where d8§ is the surface area element of cB(r) and y is the Euler 
number of 0A (r). Thus 


~ 


[seen (Fee 28 + Resa BAVC *Y,)) AS 


= = m+ | “2 (thi Yt ¥.) a8. (9.7) 
dB(r) 
By the Einstein equations, 
gh+ Ra ie Y,? a 87T ve Y,?, 


which is > 0 by the dominant energy condition. The Euler number ¥ 
is +2 for the sphere, zero for the torus, and negative for any other 
compact orientable two-surface (¢B(r) has to be orientable as it is 
a boundary). Hence the right-hand side of (9.7) can be negative only 
if 0B(r) is a sphere. 

Suppose that the right-hand side of (9.7) was positive. Then one 
could choose y so that d&’/dw’|,,. was positive everywhere on 04% (7). 
For small negative values of w’ one would obtain a two-surface in 
J~(4*,.d@) such that the outgoing null geodesics orthogonal to the 
surface were converging. This would contradict proposition 9.2.8. 


9.3] THE FINAL STATE 337 


Suppose now that y was zero and that T,,,Y,“Y,° was zero on @4(7). 
Then one could choose y so that the sum of the first four terms on the 
right of (9.6) was zero on 0@(r). Then 


pe, h?,+ RareaY;*¥P¥°¥.4 = 0 

on 0&(r). If Roy.g¥4° Yo ¥,°¥,% was non-zero somewhere on 0&(r), then 
the term p’*p’, in (9.6) would be non-zero somewhere and one could 
change y slightly so as to make d6’|dw’|,,_) positive everywhere. This 
would again lead to a contradiction. 

Now suppose that R,,,.4¥,°¥," ¥,°Y,% and p’* were zero everywhere 
on 04(r). One could move the two-surface 0&(r) back along Y,*, 
choosing the rescaling parameter y at each stage so that 


Ss db he, te Regal Se ry, 
<i 3h— 2h Vi°Y2" a p'?,,h>,—4k = 0. 

If T,, ¥,°¥,” or p’? were non-zero for w’ < 0 then one could adjust y to 
obtain a two-surface in J-(.%*,.W@) with 8 < 0. This would contradict 
proposition 9.2.8. On the other hand if 7,,¥,7Y,’ and p’* were zero 
everywhere for w’ < 0, one would obtain a two-surface in J-(.%+,.@) 
with 8 = 0 which again contradicts proposition 9.2.8. 

One avoids a contradiction only if y = 2, ie. if 0#(T) is a two- 
sphere. QO 


Proposition 9.3.3 

Let (-@, 8) be a stationary regular predictable space-time. Then the 
Killing vector K¢ is non-zero in Jt(4-, M) 1 J-(F+, M), which is 
simply connected. Let 7, be such that S(T») 9 J~(4+, M) is contained 
in J*(%-,.@). If 0B(7,) has only one connected component, then 
J+($-, M) 1 J-( $+, MH) M is homeomorphic to (0, 1) x S?x Ri. 


The function x defined in proposition 9.3.1 is continuous on 
J+($-, M) J-(£+*, M), and has the property that 2\|9.,) = 2|, +. 
This shows that K cannot be zero in J+(.%-,.W@)n J-(£+, W@). The 
integral curves of K establish a homeomorphism between two of the 
surfaces a 
J+(E(t), H)n I-( 9+, M)1 M (-—wo<t<o). 

The region J+(%-, “)n J-(£+,. UM) M is covered by these sur- 

faces, and so is homeomorphic to R!x J+(@(t’), @)n J-(F*, MH) M 


for any t’. Choose t to be large enough that J+(€(t), @) intersects 
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S (ro) in the neighbourhood Y of 4+ which is isometric to a similar 
neighbourhood in an asymptotically simple space. The integral curves 
of K establish a homeomorphism between 


I+(6(t), M)n I-(F+, M)) M and L(t)n I-(I*, M). 


By property («) and proposition 9.3.2, this is simply connected. If 
further 0(r) has only one connected component, then 


S(T) J-(F*, A) 


has the topology [0, 1) x S?. Thus J*(4-, M) 1 J-(£+,M) 1 W has the 
topology [0, 1) x S? x Rl. oO 


Proposition 9.3.4 

In a static regular predictable space-time, the Killing vector K is 
timelike in the exterior region J+(.%-,.W) 0 J~(4*, @) and is non-zero 
and directed along the null generators of J>(F* aon 

J-( I+, M) 1 J+(.F-,M). 

The event horizon J~(.%+,.#)is mapped into itself by the isometry @,. 
Thus on J-(.¥+,.W)n J+(F~-, W@), K must be null or spacelike. Let 75 
be such that S(7,)N J-(%+,.@) is contained in J+(.%-,./). Then 


f = K¢K, must be zero on some closed set. in f 


TS (7) IF *, Ml), 
From the fact that A® is a Killing vector and curl K = 0, it follows that 


ye, = Keon: (9.8) 

By proposition 9.3.3, AK is non-zero on the simply connected set 

J+(I-, WM) J-(%+, M). By Frobenius’ theorem, it follows from the 

condition curl K = 0, that there is a function € on this region such that 
K, = —«&,,, where « is some positive function. 

Let p be a point of VW and let A(v) be a curve through p lying in the 
surface of constant € through p. Then by (9.8), 


d D 
1Ka == a 
4K qp oat = 5p ; 
If A(v) left, the left-hand side of this equation would be unbounded. 


However the right-hand side is continuous; therefore A(v) must lie 
in. V’,so.A” must contain the surface = £|,,. However f cannot be zero 
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on an open neighbourhood of p, since it would then be zero every- 
where. Thus the connected component of . through p is the three- 
surface € = £|,. Suppose peJt(%-,.W)n J-(4+,.W). Then there 
would be a future-directed timelike curve y(w) from .%~ through p 
to #t. On € = €|,, K* would be future-directed. Thus (G/eu),§ > 0 
when é = |,,. This leads to a contradiction as £ = |, cannot intersect 
JF* or F— since K® is timelike near infinity. Thus near .%+ and .f-, 
either € is greater than £|,, or less than §].,,. a) 


Proposition 9.3.5 
In an empty regular predictable space-time which is not static, the 
Killing vector A? is spacelike in part of the exterior region 
Jt+(I-,M) 1 J-( I+, M). 
The function x introduced in proposition 9.3.1 is continuous on 
I+( I>, MN Ja, A), and is such that along each integral curve 
of K*, @#/ob=1. One can approximate the surface x= 0 in 
J+(F-, M)n J-(£+, M) by a smooth surface # which is nowhere 
tangent to K?. One can then define a smooth function Z on 
J+($-, M@) J-(£+, M) by specifying that Z = 0on # and TD Geo le 
One can express the gradient of the Killing vector as 
Rat = Navea Kw? + Kiaf 1; 
where f = K¢K, is the magnitude of the Killing vector, and 
“ot = ¥y"4K, K,, a: 
The second derivatives of K satisfy 
2K a: hel = Bagpe 
However Ky. = Kja:sje. Therefore 
K a:be — Raa ee 
which implies ei? = — he Ke. (9.9) 
The vector g, = f1K,—%., is orthogonal to K*. Multiplying (9.9) by 
g, and integrating over the region & of J-(%*,.@) bounded by the 
surfaces V, and W, defined by % = x,+1 and x = x, +2, where 2, is 
as in proposition 9.3.1, one finds 


[_,BaoKegrae = — | (Kesbgg).y dv { Keg 
ie LG Lf 


--| Keitg, doy—2{ fot, dv. (9.10) 
OL Le 
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The boundary 0 of F consists of the surfaces dL, = Win J(4t, M), 
OL, = Ny J-(F+, M), the portion dF, of J-(4+, MH) between 1, and 
AN;,, and the portion 0%, of — between 4 and .4,. The surface integral 
over 0f, is minus that over 6%, since these surfaces are carried into 
each other by the isometry 9,. 

Near %-, f= —1+(2m/r)+O(r-*) and w%w, = O(r-*), where r is 
some suitable radial coordinate. Thus the surface integral over 0%, at 
J~- vanishes. Suppose now that K¢ were timelike everywhere in &, 
becoming null on the horizon. Then w?, being orthogonal to K, would 
be spacelike everywhere in &. Therefore if w is non-zero, i.e. the 
solution is non-static, the last term on the right of (9.10) will be 
negative. This leads to a contradiction if the space is empty and if the 
integral over 0f, is zero. 

To evaluate this integral, one has to apply a limiting procedure. 
Let z be a function on the surface 4, which is zero on the horizon but 
such that the gradient of z in/, is not zero on the horizon. The function 
z can be defined on¥ by the condition z,, K* = 0. One can express the 


gradient of z as Z.q = E,,239(Ka+fR,); 


where R¢ is a vector field tangent to the surfaces {7 = constant} and 
normalized so that RK, = —1. One now takes i K2+>q, do, over the 
surface {z = constant} between 47 and .4;. Then do, = doz.,, where 


do is some continuous measure. Thus = 


[x= % do, => [GFF R44 AS RYE 92 odo. 


Since the horizon was the surface f = 0 and since K@ was directed 
along the null generators of the horizon, f a 18 proportional to A¢ on 
the horizon. Therefore 
Keg do, = 0. 
aL, 
This gives a contradiction which shows that A* must be spacelike 
somewhere in ¥ if the space is empty. Oo 


Proposition 9.3.6 


Let (., 8) be a stationary non-static regular predictable space-time 
in which the ergosphere intersects J-(%+,.@ )n Jt+(%-, M). Then 
there is a one-parameter cyclic isometry group 6 (Oe POr 
(M,8) which commutes with 6,, and whose orbits are spacelike near 
I+ and F-. , 
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Let 2, be one connected component of J-(4+,.W/)n J+(%-,. 7), and 
let Y, be the quotient of 2, by its generators. Then the orbits of the 
isometry @;, in the horizon 2, will be spirals which repeatedly intersect 
the same generators. Let t; > 0 be such that ,, is one rotation of 4). 
Then if pe 2,. @,(p) will lie on the same generator of 2,. It will lie to 
the future of p, since 
@lo.() = Xp +b. 

One can now choose the future-directed null vector Y, to be directed 
along the generators, and scaled so that 


De — 26Y,,, ewhere e747 0, 
(ii) if visa parameter along the generators such that Y, = 0/¢v, then 


Plato = lp +h. 


The vector field Y, defined in this way is invariant under the isometry 
@,.1.e. Lg ¥, = 0. One can now define a spacelike vector field Y, in 2, 
by Y,; = K—Y,: then Lg Y, = 0 and Ly Y3 = 0 (note that Y; is not 
@ unit vector. and in fact it will vanish on the generators y, and y, 
corresponding to the poles of Y,). The integral curves of Y, in 2, will 
be circles which degenerate to points on y, and yp. 

Let je bea curve in 2, from y, toy, orthogonal to Y, and Y,, and 
such that the orbits of Y, which intersect ~ form a smooth spacelike 
two-surface A in 2,. Let A(v) be the family of spacelike two-surfaces 
in 2, obtained by moving each point of FY a parameter distance v up 
the generators of 2,. A(y) is also equal to 0,(P). Let Y, be the other 
null vector orthogonal to -A(v), normalized so that ¥,7Y,, = — 1 (see 
figure 61); then Lg Y, = 0. 

Let Y, be a spacelike vector on uw, tangent to x. Then one can define 
Y, on 2, by dragging it along by K and Yj, i.e. Lg Y, = 0 = Ly, Yq. 
(These are compatible because Ly Y, = 0.) Y, will be orthogonal to Y, 
on 2, because L[g(Y¥,°9,,¥,°) = 0, and 

(Ya Via) oN? = Nyt oY Na tN yYaah’. 
The first term is zero because Y, is null and the second term equals 
2eY,, ¥,*. Thus Y,,¥,%, being zero initially, remains zero. Y, will be 
orthogonal to Y, on 2, because it lies in the surface A(v), and Y, is 
normal to the surface. It will also be orthogonal to Y; on 2, because 
Tng(¥o?Jqn¥i°) = 0, and 
(ae Ve a DE Cee = Sue An ee =0 

since Yyq.ph2h?4 =) 


I2 : HLS 
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Ficure 61. The isometry @,, moves the point p and the surface A(v) into the 
point 6,(») and the surface A(v+¢,) in the horizon 2,. Y, is tangent to a null 
geodesic generator of 2,. Y, is a null vector orthogonal to A(vw and Y, lies in 
Y(v). Kis the Killing vector field on 2, which generates the isometry group 4. 


In a neighbourhood of 2,, there will be a unique null geodesic A 
orthogonal to a surface A(v) through a given point r. One can then 
define coordinates (v,w.0,¢) for the point r, where w is the affine 
distance (as measured by Y,) along «, and (v,@.¢) have their values 
at nn 2,, where @ and © are spherical polar coordinates for the 
generators of 2, such thatel;@ , = 0. 1j%@, = Ominjother words, 
we choose Y, = (27/t,)c/¢¢ and Y, = ¢/@@ on Y,.) We shall take the 
basis {Y,, Y.. Y3, Y4} to be parallelly propagated along the null geo- 
desies with tangent. vector Y,. Then Y, = é/éw. We define the vector 
K to be e/év. This means that the Lie derivative of K by Y, is zero. 
We define the vector Z¢ to be 


1 { Ve a J Y,¢ 
a es a a | 
V2 (a (Y,°Y4,)2 
Then Lea, = 0, ZZ, sel, ZAZA 0, 


where ~ denotes the complex conjugate. 
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One can define on 2, a family {g,,} of tensor fields, where 


=e and g, = Ly, (Ly,(...(Ly,8)...)). 
SE ee ee 
nm terms 


In the coordinates given above, g,, 4, = 0"(Jqy)/Cw”. Since the solution 
is analytic, it is completely determined by the family g, on 2,. We 
shall show that on 2,, the Lie derivatives with respect to K of all the 
g,, vanish. Then the Lie derivative of the g,, with respect to K = K—K 
will also vanish. This shows that the solution will admit a one- 
parameter group 6, generated by K. For simplicity we shall consider 
only the empty space case, but similar arguments hold in the presence 
of matter fields, like the electromagnetic or scalar fields, which obey 
well-behaved hyperbolic equations. 

By our choice of coordinates, the components of Lg § are the partial 
derivatives with respect to v of the coordinate components g,,. These 
are all constant on 2,,s0 Lg §|,, = 0. We shall show below Lg é,| 2, = 9, 
and then proceed by a method of induction. Suppose that 


Lggn|o,=9, n>. 


It then follows from the construction of the basis that Lg of the nth 
covariant derivatives of all the basis vectors Y,,Y,, Z, Z are zero. 
or Gnttab = In Bete a5 In cb Jee a 2 In He oe ie 

The Lie derivative with respect to K of the second and third terms 
on the right are zero. Nhe first term involves covariant derivatives of 
Y, of order (n+ 1) and lower orders. The Lie derivative with respect to 
K of all the lower order terms are zero. The terms involving (n + 1) 
covariant derivatives are 


(Yssgpape. nen 75 ¥, ti —— ys. : Ge 
= (Voq.neto’ + Yop: ac¥e’). ¢...ghe¥o'---Yo° + lower order terms 
= (G2. ee a Led Oe $5 (Yo5- Gs Ee a R, Pe eu Ey Bras 
x Y,/...Y,°+ lower order terms. 


The Lie derivatives with respect to K of this expression will be zero, 
if the Lie derivative with respect to K of the Riemann tensor and its 
covariant derivatives to order (n—1) vanish. Then Lg$,,,,|9, will be 
Zero: 

To show that the Lie derivatives with respect to K of g, and of the 
covariant derivatives of the Riemann tensor are zero, it is convenient 
to use some notation introduced by Newman and Penrose (1962). 


12-2 
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This involves using a pseudo-orthonormal basis with the two spacelike 
vectors Y, and Y, combined to give a single complex null vector Z, 
giving each component of the connection and the curvature tensor 
a separate symbol, and writing out all the Bianchi identities and the 
defining equations for the curvature tensor explicitly without samma- 
tion. These relations are combined in pairs to form half the number 
of complex equations. The symbols for the connection components are: 


ie Vere Aa se =e aM 
P= Tay pane A= lean ZZ, 
ce 2 ie AEE, [= eRe Zaz”, 
T= Vas oZ%,’, v= —¥yq,, 2%, 


€= 8 anlel—2ii52), @= ihe ee 4, 
B= WY a; 0%e°Z’-Zq; 29D), y= Nias oYa7 Vso -Z,, p ZY’). 
The symbols for the Wey] tensor are: 
Yo = =Car Ze. 
V1 = — Capea? YP VS? 
Ve = — Cavea(Yi* Yo Vy’ Yo4 — ¥,°¥,2 2°24), 
We Cy ei cae 
Ye = =e 


We are considering empty space, so the Ricci tensor is zero (i.e. 
® 4, = 0 = A in the Newman-Penrose formalism). Since the basis is 
parallelly propagated along Y,, » = y =7 = 0. As Y, is the gradient 
of the coordinate *% m= P+ and p= #. Furthermore on 2,, 
K=p—o =0,€=6, V(€) = Ganda — 

The equations we shall need are: 


¥,(x) —Z(e) = (o+€—2€)a+ BG~Pe—KA+(e+p)z, (9.11a) 
Y,(B)—Z(e) = (a+7m)o+ (p—€) B—pe— (—m)e+'V,, (9.115) 
¥,(A)—Z(m) = pA+ Gu +22 + (a—B)n—(3e-2)A, (9.11¢) 
Vi(4)—Z (7) = pu+or+na—(e+8)n— m(a—-B)+¥,, (9.11d) 
Z(p)—Z(o) = p(&+f)—o(3a—f)-¥, (9.11) 
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(these are obtained from the Newman-—Penrose equations (4.2)), and: 


mt, —Z0e = — 3KF,+ (264+ 4p) ¥,-—(-—7+4a)WVy, (9.124) 
VCE.) —Z(L,) = — 2e'F + 30V,— (— 27 + 2a)¥,—AW,, (9.126) 
¥,(¥s)—Z(¥,) = —K¥, — (26— 2p) ¥5+37F.— 2AY a (9.12c) 
Y,(¥,)-—Z(¥) = — (4e—p) V4 (4a + 2a) ¥— 3A'P,, (9.12d) 
Y,(¥) —Z(¥}) = —#¥ —26¥ +30, (9.12 e) 


(these are obtained from the Newman—Penrose equations (4.5)). 

From (9.11¢). ¥'; = 0on 2,. Then from (9.126), ¥\(W,) = K(¥,) = 0 
on 2,. Adding (9.11@) to the complex conjugate of (9.11), one 
obtains 


Y,(7) = ¥,(a+ B) = Z(e)+ Zé) + 2mp + 2G — n(e—€) —KA—KA+¥. 
On 2,, this becomes Y¥,(7) = Ze) + Zé). 


Therefore ¥,(¥,(7)) = ¥,(Z(e)+ Z(é)) on 2,. But on Q,, Ly, Z = 0 and 
¥,(e) = 0. Thus ¥,(2{(7)) = 0 on 2,. This shows that 7 = A+ Bvon 2,, 
where dA and B are constant along a generator of 2,. However 
7 |, = Te,q; therefore 7 is a constant along the generators of 2,. 
Subtracting the complex conjugate of (9.116) from (9.11a@), one finds 
that (« — #) is constant along the generators. 

One now applies similar arguments to (9.11¢) and (9.11d) to show 
that and A are constant along the generators of 2,. Sincez, wand A 
determine the covarianf derivative of Y,. it follows that Lg Y,*., =0 
on 2, and hence that Legg, = 0 on J,. 

One can also apply the above kind of argument to (9.12c) and 
(9.12d) to show that Y(‘"5) = Y,(¥,) = 0 on 2,. Thus Lg Royog = 0 on 
2, and so the Lie derivative with respect to K of the second derivatives 
of the basis vectors are zero. In particular Y, Y, acting on any of the 
components of the connection gives zero. 

From (9.12e), K(¥,(",)) = ¥,¥(") = 0 on 2,. One now operates 
with Y, Y, on (9.12a). The commutator Y, Y,—Y, Y, involves only 
the first covariant derivatives of the basis vectors. Thus 


ee y,.—Y,Y,)i20 one?,. 
From this it follows by an argument like that given above that 
K(Y,(¥,)) = H(%(¥s)) = 0 on 2. 


One now repeats the argument for (9.106), (9.10c) and (9.10d) to show 
that K(Y,(¥,)) = K(¥("3)) = K(¥(",)) = 0 on 2,. This shows that 
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the Lie derivatives with respect to K of the first covariant derivatives 
of the Riemann tensor vanish. One then repeats the process, showing 
that K(Y,(¥,(,))) = 0 on 2,, and so on. Oo 


Proposition 9.3.7 
Let (.%,&) be a space-time which admits a two-parameter abelian 
isometry group with Killing vectors §, and &,. Let VY be a connected 
open set of A, and let wy, = fyraSon1- Tf 
(a) Wop Fe, 9° 1 Wes = 0on v, 
(b) w,, = 0 at some point of Y, 
then War; ede = 0 on Y. 
Let wx = Gia: SaPe nea and on = boa. b Ved 1 ea Then 
grbea ny = — Al £ [ese ec 67 
= BIE ECE PI SIEGE lag biel 2 8) Eee Ee 
Therefore 
(31) 4x: a = eit. ponte Eee ese qe” 
dees eps ea Sees Gute Scare | 
= fae ler ; oF a 26a d Eve °° : 
— 2G ME o? gba 4 — 26 E97, 7g. (9.13) 
The first and fourth terms vanish because &, and &, are Killing vectors; 
the second and fifth terms cancel each other because &, and §, com- 
mute. Because &, is a Killing vector, LZ; €),., = 0. This implies that 
the third term vanishes. Similarly Lg,51, ,, = 0 because &, is a Killing 
vector which commutes with &,. This implies that the sixth and eighth 
terms cancel. The seventh term vanishes because £,7, ,£,°'4 is sym- 
metric; and because of the relation £,.,. = Racpa$? satisfied by any 
Killing vector, £4: 4, = — R%,£°. Equation (9.13) is therefore 
By condition (a), the right-hand side of this equation vanishes on V. 
Thus «py is a constant on V; in fact it will be zero on Y since it must 
vanish when w,, does. Similarly (vy will be zero on ¥”. However the 
vanishing of ;)y and (yy is the necessary and sufficient condition that 


Wad;cWdle = 0. oO 
Proposition 9.3.8 


Let (4, 8) be a stationary axisymmetric regular predictable space— 
time in which wjgy..Waie = 0, where w,, = K,, K,). Then at any point 
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in the exterior region J+(%-,. M)n J-(%+, M@) off the axis K = 0, 
h = w,,w™ is negative. On the horizons J-(.%+, @)n J +(9-, M) and 
J+(S-, M)n I-(Ft, M), his zero but W,» + 0 except on the axis. 


By proposition 9.3.3, K@ is non-zero in J+(.4-,./) n J-(£+, M). Let 
A be an S! which is a non-zero integral curve of the vector field K in 
J+(S-, MH) J-(£+,.W). Under the isometry §,, A can be moved into 
D*(f). As there are no closed non-spacelike curves in D+(S), A must 
be a spacelike curve, and hence K¢ must be spacelike in 


J+(.9-, M)n J-( I+, M) 


except on the axis where it is zero. Suppose there were some point p at 
which K¢ and A“ were both non-zero and in the same direction. As 
K¢ and K¢ commute, the integral curves of K* through p would 
coincide with those of A¢. However the former is closed while the 
latter is not. Thus K@ and K@ are linearly independent where they are 
non-zero. Thus w, is non-zero in J+(.%-, H) n J-(£+, M) except on 
the axis. 

The axis will be a two-dimensional surface. Let Y be the set 
J+(4-,.M)\n J-(I+,.@) —(the axis), and let ¥ be the quotient of Y 
by 6. As the integral curves of K@ are closed and spacelike in Y, the 
quotient 2 will be a Hausdorff manifold. On &, there will be a 
Lorentz metric h,, = 9,, —(K°K,) 1K, K,,. One can project the Killing 
vector K® by h,, to obtain a non-zero vector field h,, K® in & which is 
a Killing vector field for the metrich,,. The condition wi.,,,Wz, = Oin 
implies that in &, (Ke hete)iq hey K' = 0, where | denotes the co- 
variant derivative with respect to h. This is just the condition that 
there should exist a function on & such that K°h,, = —a&,,. The 
argument is then similar to that in proposition 9.3.4. One shows that 
if K,K,h® = 0 at a point peZ®, then the surface € = é|,, is a null 
surface in & with respect to the metric h. The function € on & induces 
a function £ on Y, with the property: £.,K* = 0. Thus é = €],, will be 
a null surface in -W with respect to the metric §. 

Suppose p corresponded to an integral curve A of K“ which did not 
lie on J-(.4+,. WH). Let ge.-W be a point of A. Then there would be 
a future-directed timelike curve y(v) from .4- through q to .¥*. If this 
curve intersected the axis, it could be deformed slightly to avoid it. 
One would then obtain a contradiction similar to that in proposition 
9.3.4. O 


10 


The initial singularity in the universe 


The expansion of the universe is in many ways similar to the collapse 
of a star, except that the sense of time is reversed. We shall show in 
this chapter that the conditions of theorems 2 and 3 seem to be satis- 
fied, indicating that there was a singularity at the beginning of the 
present expansion phase of the universe, and we discuss the implica- 
tions of space-time singularities. 

In §10.1 we show that past-directed closed trapped surfaces exist 
if the microwave background radiation in the universe has been 
partially thermalized by scattering, or alternatively if the Copernican 
assumption holds, i.e. we do not occupy a special position in the 
universe. In §10.2 we discuss the possible nature of the singularity 
and the breakdown of physical theory which occurs there. 


10.1 The expansion of the universe 


In §9.1 we showed that many stars would eventually gollapse and 
produce closed trapped surfaces. If one goes to a larger scale, one can 
view the expansion of the universe as the time reverse of a collapse. 
Thus one might expect that the conditions of theorem 2 would be 
satisfied in the reverse direction of time on a cosmological scale, pro- 
viding that the universe is in some sense sufficiently symmetrical, and 
contains a sufficient amount of matter to give rise to closed trapped 
surfaces. We shall give two arguments to show that this indeed seems 
to be the case. Both arguments are based on the observations of the 
microwave background, but the assumptions made are rather 
different. 

Observations of radio frequencies between 20cm and 1 mm indicate 
that there is a background whose spectrum (shown in figure 62 (i)) 
seems to be very close to that of a black body at 2.7°K (see, for 
example, Field (1969)). This background appears to be isotropic to 
within 0.2% (figure 62(ii); see, for example, Sciama (1971) and 
references given there for further discussion). The high degree of 
isotropy indicates that it cannot come from within our own galaxy (we 
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FIGURE 62 

(i) The spectrum of the microwave background radiation. The plotted 
points show the observed values of the ‘excess’ background radiation. The 
solid line is a Planck spectrum corresponding to a temperature of 2.7 °K. 

(a1) The isotropy of the microwave background radiation. The temperature 
distribution along the celestial equator is shown; more than two years of data 
have been averaged to obtain these points. 

From D. W. Sciama, Modern Cosmology, Cambridge University Press, 1971. 
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are not symmetrically placed in the plane of the galaxy) but must be of 
extragalactic origin. At these frequencies we can see discrete sources 
some of whose distances are known from other evidence to be of the 
order of 102?cm, so we know that the universe is transparent to this 
distance at these wavelengths. Thus radiation which is produced by 
sources at distances greater than 1027 cm must have propagated freely 
towards us for at least that distance. 
Possible explanations of the origin of the radiation are: 


(1) the radiation is black body radiation left over from a hot early 
stage of the universe; 

(2) the radiation is the result of superposition of a very large number 
of very distant unresolved discrete sources; 

(3) the radiation comes from intergalactic grains which thermalize 
other forms of radiation (perhaps infra-red). 


Of these explanations, (1) seems the most plausible. (2) seems im- 
probable, as there do not appear to be sufficient sources with the right 
sort of spectrum to produce an appreciable fraction of the observed 
radiation in this frequency range. Further, the small scale isotropy of 
the radiation implies that the number of discrete sources would have 
to be very large (of the order of the number of galaxies) and most 
galaxies do not seem to radiate appreciably in this region of the 
spectrum. (3) also seems unlikely, since the density of interstellar 
grains which would be needed is very large indeed. Although (1) seems 
the most probable, we will not base our arguments on it, since to do so 
would be to presuppose that the universe had a hot early stage. 

The first argument involves the assumption of the Copernican 
principle, that we do not oceupy a privileged position in space-time. 
We interpret this as implying that the microwave background radia- 
tion would appear equally isotropic to any observer whose velocity 
relative to nearby galaxies is small. In other words, we suppose there 
is an expanding timelike geodesic congruence (expanding because the 
galaxies are receding from each other, geodesic because they move 
under gravity alone with unit tangent vector J”, say), representing 
the average motion of the galaxies, relative to which the microwave 
radiation appears almost isotropic. From the Copernican principle it 
also follows that most of the microwave background has propagated 
freely towards us from a very long distance (~3 x 102?cm). This is 
because the contribution to the background arising from a spherical 
shell of thickness dr and radius 7 about us will be approximately 
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independent of r, since the amount produced in the shell will be pro- 
portional to 7? and the reduction of intensity due to distance will be 
inversely proportional to 7?. This will be the case until the redshift of 
the sources becomes appreciable, source evolution takes place, or 
curvature effects become significant. These effects will however only 
come in at a distance of the order of the Hubble radius, ~ 1028 em. 
Thus the bulk of the radiation will have travelled freely towards us 
from a distance S 10?7cm. From the fact that it remains isotropic 
travelling over such a long distance, we can conclude that on a large 
scale the metric of the universe is close to one of the Robertson— 
Walker metrics ($5.3). This follows from a result of Ehlers, Geren and 
Sachs (1968), which we will now describe. 

The microwave radiation can be described by a distribution func- 
tion f(u, p) (we-M@, pe T,) defined on the null vectors in T(-#), which 
can be regarded as the phase space of the photons. If the distribution 
function f(u, p) is exactly isotropic for an observer moving with four- 
velocity V7, it will have the form f(u, Z) where EZ = — V%p,. Since the 
radiation is freely propagating, f must obey the Liouville equation in 
T(M). This states that f is constant along integral curves of the 
horizontal vector field X, i.e. along any curve (u(v), p(v)) where u(v) is 
a null geodesic in -# and p = O/dv. 

Because f(u, Z) is non-negative and must tend to zero as H > co 
(since otherwise the energy density of radiation would be infinite), 
there must be an open interval of E for which éf/éE is non-zero. In 
this interval, one can express E as a function of f: H = g(u,f). Then 
Liouville’s equation implies that 


dE/dv = g.,p* (10.1) 
on each null geodesic, where one regards g as a function on -@ with 
ffixed. Also, g#/dy = —d(V%p,)/dv = —V, ,,p*p’. (10.2) 


One can decompose p* into a part along V“ and a part orthogonal 
to Vt: pt = E(V*+W*), where W°W,=1, W°V,=0. Then from 
(10.1) and (10.2), 
dg/dt+ $09 + (WV.+9;2) W*+9oa WW? = 0 
holds for all unit vectors W* orthogonal to V%, where dg/dt is the rate 
of change of g along the integral curves of V. Separating out spherical 
harmonics, Oo» = 0, (10.3.4) 
V+ (log 9); = aV,, (10.35) 
49 = —d (log g)/dt. (10.3) 
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Since we assumed that V, was zero, (10.36) shows that V, is orthogonal 
to the surfaces {g = constant}, and this implies that the vorticity Wg, 
is zero. As V4 = 0, Vj, 5; = 0. Thus one can write V, as the gradient of 
a function ¢: V, = —t 4. 

The energy-momentum tensor of the radiation will have the form 

Top = aly Vile a 3h, Jab> 

where 4, = | fE®° dE. Since the motion of the galaxies relative to the 
integral curves of V@ is small, their contribution to the energy— 
momentum tensor can be approximated by a smooth fluid with 
density pg, four-velocity V, and negligible pressure. It now follows 
that the geometry of the space-time is the same as that of a Robertson— 
Walker model. To see this, note that 


(Ve) aa 3(A( hae hl Ae 
a Ve oe eave zz 6.5+ Boa ee 
Multiplying this equation by h®, = g°?,+ V°V,, one finds 
Dial ye=— 5 hd, A 
The left-hand side vanishes by the field equations. Thus @ is constant 
on the surfaces of constant ¢ (which are also the surfaces of constant g). 
One can define a function S(t) from 6 by S’/S = 40; then the 
Raychaudhuri equation (4.26) takes the form 
38°" /S+474—A = 0, 
which implies that 4” = “4, +24p is also constant on the surfaces 
{t = constant}. From the definition of w, we see that the terms wv, and 
/tp are separately constant on these surfaces. 

The trace-free part of (4.27) shows that Cyj.qg V°V? = 0. The Gauss— 
Codacci equations ($2.7) now give for the Ricci tensor of the three- 
spaces {tf = constant} the formula 

RB, = h,° h,t hat as Ad ais 00 1 26 CP a, 
= 2h,,(—40? + 87+ A). 
However for a three-dimensional manifold, the Riemann tensor is 
completely determined by the Ricci tensor, as 
BP bed = Nav‘ ( a Bs, 2 $Rh,5) ca: 
This shows that each three-space {f = constant} is a three-space of 


constant curvature A(t) = 3(87+A—46?). Integrating the Ray- 
chaudhuri equation shows that 


K(t) = 4(874 + A—3S8"2/S82) = k/S2, (10.4) 


~ 
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where i is a constant. By normalizing S, one can set k = +1, Oor —1. 
The four-dimensional space-time manifold is the orthogonal product 
of these three-spaces and the ¢-line. Thus the metric can be written in 
comoving coordinates as 


ds? = — dt? + S2(t) dy’, 


where dy? is the metric of a three-space of constant curvature k. But 
this is just the metric of a Robertson—Walker space (see $5.3). 

We shall now show that in any Robertson—Walker space containing 
matter with positive energy density and \ = Othere isa closed trapped 
surface lying in any surface {¢ = constant}. To see this, we express dy? 


in the form dy? = dy? + f2(y) (d62-+ sin? dg?) 


where f(v) = siny. y or sinh y if k = +1, 0 or —1 respectively. Con- 
sider a two-sphere .7 of radius > lying in the surface ¢ = f). The two 
families of past-directed null geodesics orthogonal to 7 will intersect 
the surfaces {f = constant} in two two-spheres of radius 


t 
y= xo | dt/S(é). (10.5) 


The surface area of a two-sphere of radius y is 47S?(¢) f?(v). Thus both 
families of null geodesics will be converging into the past if, at t = fp, 


& (S%(t)f2(q)) > 0 


holds for both values of y given by (10.5). This will be the case if 


S'(to) S if F (Xo) 


S(to) S(to) f(Xo) 
But by (10.4), this holds if 
(Smp(to) S?(to) —)t > +f (Xo) /f(Xo)- 


This will be the case if S(f,) ¥> is taken to be greater than (3/879) for 
k = 0 or —1, and to be greater than min (,/(3/879), 37) fk = +1. 
An intuitive way of viewing this result is that at time ty a sphere of 
coordinate radius y, will contain a mass of the order of 77/19 S7(to) Xo°s 
and so will be within its Schwarzschild radius if S(fo) Vo is less than 
Sty S(to)? Xo, i.e. if S(to)Xo is greater than the order of ./(3/87 fo). 
We shall call ./(3/87/79) the Schwarzschild length of matter density /o. 
So far, we have assumed the microwave radiation is exactly iso- 
tropic. This is of course not the case; and this corresponds to the fact 
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that the universe is not exactly a Robertson—Walker space. However, 
the large scale structure of the universe should be close to that of 
a Robertson—Walker model, at least back to the time when the radia- 
tion was emitted or last scattered. (One can in fact use the deviations 
of the microwave radiation from exact isotropy to estimate how large 
the departures from a Robertson—Walker universe are.) For a suffi- 
ciently large sphere, the existence of local irregularities should not 
significantly affect the amount of matter in the sphere, and hence 
should not affect the existence of a closed trapped surface round us at 
the present time. 

The above argument did not depend on the spectrum of the micro- 
wave radiation, but it did involve the assumption of the Copernican 
principle. The argument we shall now give does not involve the 
Copernican principle, but does to a certain extent depend on the shape 
of the spectrum. We shall assume that the approximately black body 
nature of the spectrum and the high degree of small scale isotropy of 
the radiation indicate that it has been at least partially thermalized 
by repeated scattering. In other words, there must be enough matter 
on each past-directed null geodesic from us to cause the opacity to be 
high in that direction. We shall now show that this matter will be 
sufficient to make our past light cone reconverge. 

Consider a point p representing us at the present time, and let W2 
be a past-directed unit vector parallel to our four-velocity. 

The affine parameter v on the past-directed null geodesics through p 
may be normalized by AW, = —1, where K = é/év is the tangent 
vector to the null geodesics. The expansion @ of these null geodesics 
will obey (4.35) with 6 = 0. Thus, providing R,,A¢K° > 0, 8 will be 
less than 2/v. It follows that at v = v, > vp, 


i "Rap KK? dv—2Iv, > 6, 


v 


so 8 will become negative if there is some ?’) such that 


v1 
| Ri, Ke’ dy > 2x. 


Vo 


Using the field equations with A = 0, this becomes 


OF e 

$0 | Sai SK? des, (10.6) 
Vo 

At centimetre wavelengths, the largest ratio of opacity to density for 

matter at reasonable densities is that given by Thomson scattering off 
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free electrons in ionized hydrogen. Thus the optical depth to a distance 
v will be less than 


| xececer,) ae, 
0 


where «x is the Thomson scattering Opacity per unit mass, p is the 
density of the matter, and V, is the local velocity of the gas. The 
redshift z of the matter is given by z = A@V,—1. Since no matter has 
been seen with significant blue-shifts, we shall assume K “VY, is always 
greater than one on our past light cone, out to an optical depth unity. 
As galaxies are observed at these wavelengths with redshifts of 0.3, 
most of the scattering must occur at redshifts greater than this. (In 
fact if quasars really are cosmological, the scattering must occur at 
redshifts greater than two.) With a Hubble constant of 100 Km/sec/ 
Mpc (~ 10 years), a redshift of 0.3 corresponds to a distance of 
about 3x 10°7°em. Taking this value for vp, the contribution to the 
integral (9.9) of the matter causing the scattering is 


UV 
3.7 x 10% I o(K, V2)? do, 
while the optical depth of the matter between v, and 2, is less than 


6.6 x 107 | ‘p(KV,) de. 


v. 
V% 
Since A°V, > 1, it can be seen that the inequality (10.6) will be satisfied 
at an optical depth of less than 0.2. If the optical depth of the universe 
was less than 1, one would not expect either an almost black body 
spectrum or such a high degree of small scale isotropy, unless there was 
a very large number of discrete sources which covered only a small 
fraction of the sky and each of which had a spectrum roughly the same 
as a 3°K black body but with much higher intensity. This seems rather 
unlikely. Thus we believe that the condition (4) (iii) of theorem 2 is 
satisfied, and so there should be a singularity somewhere in the 
universe provided the other conditions hold. 

Because of its generality, theorem 2 does not tell us whether the 
singularity will be in our past or in the future of our past. Although it 
might seem obvious that the singularity should be in our past, one can 
construct an example in which it is in the future: consider a Robertson— . 
Walker universe with & = + 1 which collapses to a singularity at some 
time t = t), and which asymptotically approaches an Einstein static 
universe for t->—oo. This satisfies the energy assumption, and con- 
tains points whose past light cones start reconverging (because they 
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meet up around the back). However the singularity is in the future. 
Of course this is a rather unreasonable example but it shows that one 
has to be careful. We shall therefore give an argument based on 
theorem 3 which indicates that the universe contains a singularity in 
our past, providing that the Copernican principle holds. Theorem 3 
is similar to theorem 2, but requires that all the past-directed timelike 
geodesics from a point shall start to reconverge, instead of all the null 
geodesics. This condition is not satisfied in the example given above, 
though it is there satisfied by the future-directed geodesics from any 
point. 

By an argument similar to that given above for the null geodesics, 
the convergence 6(s) of the past-directed timelike geodesics from a 
point p will be less than 


3 8 
the: | Ray VeVP ds, 
So So 


where s is proper distance along the geodesics. V = ¢/@s and s > 8». 
Let W be a past-directed timelike unit vector at p, and letc = — V¢W,|, 
(so c > 1). Then @ will become less than —c within a distance R,/c 


along any geodesic if there is some Ry, R, > R, > 0, such that 


R,Je 
R,, V7V° ds > ¢(3/Ry+e) (10.7) 
R,/e 
along that geodesic. Condition (3) of theorem 3 will then be satisfied 
with b = max (Ry, (3€)-). = 


To make (10.7) appear more similar to (10.6). we shall introduce an 
affine parameter v = s/c along the timelike geodesics; then (10.7) 


becomes 18h, 
VR | Ry, K*K® dv > 14+ 4Rye, (10.8) 


where K = é/év and A¢W,|,, = —1. We cannot verify this condition 
directly by observation as in the case of (10.6) because it refers to 
timelike geodesics. We therefore have to appeal to the arguments 
given in the first part of this section to show that the universe is close 
to a Robertson—-Walker universe model at least back to the time the 
microwave background radiation was last scattered. 

In a Robertson—Walker model, let W be the vector — é/ot. Along 
a past-directed timelike geodesic through p, 

d 


Fp Wa K%) = Wap KOK? 


1 
Gq (W*Ka)*— 1c%}. 
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Therefore, providing that dS/dt > 0, W,A«@ < — 1. However 
Wei = dijdy: 
thus for some ¢ > 0, (10.8) will be satisfied for every geodesic provided 
that there are times t,, tz with t, < ts < #, such that 
= 
3 
By the field equations with A = 0, 
Ray KOK? = 8n{(u + p) (W,K*)?—3(u—p)c}. 
Therefore, providing p > 0, 
ee, BOK? SS 4nu(W Kk)? 
Thus (10.9) will be satisfied if 


ts 
| Ry K¢K?( —W, Key at > 1, (10.9) 
ty 


t,—t, [ts 
2] 4rpedt > 1. (10,10) 


Assuming that the microwave radiation has a black body spectrum 
at 2.7° RK. its energy density is about 10-94 gm em~% at the present time. 
If this radiation is primaeval, its energy density will be proportional 
to S~4. Since S-1 = O(-4) ast tends to zero, one can see that (10.10) can 
be satisfied by taking ¢, to be }¢,, and ¢, to be sufficiently small. How 
small t, has to be depends on the detailed behaviour of S, which in turn 
depends on the density of matter in the universe. This is somewhat 
uncertain. but seems to lie between 10-31gmem~-? and 5 x 10-% om 
em~*. In the former case, ty will have to be such that S(¢,,)/S(t.) 2 30, 
and in the latter casé, S(t,)/S(tg) = 300. Since the microwave radiation 
seems to be all pervasive, any past-directed timelike geodesic must 
pass through it. Thus an estimate based on the Robertson—-Walker 
models should be a good approximation for its contribution to (10.10), 
provided that the radiation was not emitted more recently than fg, 
and provided that a Robertson—Walker model is a good approximation 
back that far. From the arguments at the beginning of this section, the 
latter should be the case provided that the radiation has propagated 
freely towards us since fy. However there may be ionized intergalactic 
gas present with a density as high as 5 x 10-* gm cm, in which case 
the radiation could be last scattered at a time ¢ such that S(#,)/S(f) ~ 5. 
The optical depth back to a time ¢ is 


tp 
[ Kuga, Ae, (10.11) 


where x is at most 0.5 if is measured in gm cm~° and ¢ in cm. 
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As before, there can be no significant opacity back tot = £, — 10'’ see, 
since we see objects at distances of at least 3 x 107cm. Taking ¢; to 
have this value, we see that the gas density will cause (10.11) to be 
satisfied for a value of t, corresponding to an optical depth of at most 
0.5. 

Thus the position is as follows. We assume the Copernican principle, 
and that the microwave radiation has been emitted either before a 
time t, such that S(t,,)/S(t,) ~ 300, or before the time corresponding to 
the optical depth of the universe being unity, if this is less than f,. In 
the former case, condition (2) of theorem 3 will be satisfied by the 
radiation density, and in the latter case by the gas density. Thus if the 
usual energy conditions and causality conditions hold, we can con- 
clude that there should be a singularity in our past (i.e. there should be 
a past-directed non-spacelike geodesic from us which is incomplete). 

Suppose one takes a spacelike surface which intersects our past 
light cone and takes a number of points on that surface; can one say 
that there is a singularity in each of their pasts? This will be the case 
if the universe is sufficiently homogeneous and isotropic in the past to 
converge all the past-directed timelike geodesics from these points. 
In view of the close connection between the convergence of timelike 
geodesics and closed trapped surfaces, we would expect this to be the 
case if the universe is homogeneous and isotropic at that time on the 
scale of the Schwarzschild length (3/87). 

We have direct evidence of the homogeneity of the universe in our 
past from the measurements of Penzias, Schram] and Wilson (1969), 
who found that the intensity of the microwave background is isotropic 
to within 4 % for a beam width of 1.4 x 10-8 square degrees. Assuming 
that the microwave radiation has not been emitted since a surface in 
our past corresponding to optical depth unity, the observed intensity 
will be proportional to T4/(1+2)4 where T is the effective temperature 
of the observed point on the surface and z is its redshift. Variations in 
the observed intensity can arise in four ways: 

(1) by a Doppler shift caused by our own motion relative to the 
black body radiation (Sciama (1967), Stewart and Sciama (1967)); 

(2) by variations in the gravitational redshift caused by inhomo- 
geneities in the distribution of matter between us and the surface 
(Sachs and Wolfe (1967), Rees and Sciama (1968)); 

(3) by Doppler shifts caused by local velocity disturbances of the 
matter at the surface; and 

(4) by variations of the effective temperature of the surface. 
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(In fact the division between (1), (2) and (3) depends on the standard 
of reference and has heuristic value only.) Thus the observations indi- 
cate that irregularities in the temperature with an angular size of 3’ of 
are have relative amplitudes of less than 1%, and that there are no 
local fluctuations of the velocity of the matter, on the same scale, of 
greater than 1 °% of the velocity of light. A region on the surface which 
had an angular diameter 3’ of are would correspond to a region which 
had a diameter now of about 10° light years. If the surface of optical 
depth unity is at a redshift of about 1000 (this is the most it could be), 
the Schwarzschild length at that time would correspond to a region 
whose present diameter was about 3 x 108 light years. Thus it would 
seem that every point on the surface of optical depth unity should 
have a singularity in its past. 

More indirect evidence on the degree of homogeneity of the universe 
in the early stages comes from the fact that observations of the helium 
content of a number of objects agree with calculations of helium pro- 
duction by Peebles (1966), and Wagoner, Fowler and Hoyle (1968), 
who assumed the universe was homogeneous and isotropic at least 
back to a temperature of about 10°°K. On the other hand calculations 
of anisotropic models have shown that in these models very different 
amounts of helium are produced. Thus if one accepts that there is a 
fairly uniform density of helium in the universe (there are some doubts 
about this), and that this helium was produced in the early stages of 
the universe, one can conclude that the universe was effectively 
isotropic and hence homogeneous when the temperature was 10° °K. 
One would therefore expect a singularity to occur in the past of each 
point at this time. 

Misner (1968) has shown that if the temperature reaches 2 x 10!°°K 
a large viscosity arises from collisions between electrons and neutrinos. 
This viscosity would damp out inhomogeneities whose lengths corre- 
spond to present values of 100 light years, and reduce anisotropy 
toa comparatively small value. Thus if one accepts this as the explana- 
tion for the present isotropy of the universe (and it isa very attractive 
one). one would conclude that there should be a singularity in the past 
of every point when the temperature was about 101°°K. 


10.2 The nature and implications of singularities 


One might hope to learn something about the nature of the singu- 
larities that are likely to occur by studying exact solutions with 
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singularities. However although we have shown that the occurrence 
of a singularity is not prevented by small perturbations of the initial 
conditions, it is not clear that the nature of the singularity which 
occurs will be similarly stable. Although we have shown in §7.4 that 
the Cauchy problem is stable under small perturbations of the initial 
conditions, this stability applies only to compact regions of the 
Cauchy development, and a region containing a singularity is non- 
compact unless the singularity corresponds to imprisoned incom- 
pleteness. In fact we can give an example where the nature of the 
singularity is not stable. Consider a uniform spherically symmetric 
cloud of dust collapsing to a singularity. The metric inside the dust will 
be similar to that of part of a Robertson—Walker universe, while that 
outside will be the Schwarzschild metric. Both inside and outside the 
dust, the singularity will be spacelike (figure 63 (i)). Suppose now 
one adds a small electric charge density to the dust. The metric outside 
the dust now becomes part of the Reissner-Nordstrém solution for 
ce? < m* (figure 63 (1i)). There will be a singularity inside the dust. as 
a sufficiently small charge density will not prevent the occurrence of 
infinite density. The nature of the singularity inside the dust will 
presumably depend on the charge distribution. However the im- 
portant point is that once the surface of the dust has passed a point 
p inside r = 7,, whatever happens inside the dust cannot affect the 
portion sq of the timelike singularity. 

If one now increases the charge density so that it becomes greater 
than the matter clensity, it is possible for the cloud to pass through the 
two horizons at r=r7, and r=r_ and to re-expand into another 
universe without any singularity occurring inside the dust, although 
there is a timelike singularity outside the dust (J.M. Bardeen. un- 
published), as indeed there ought to be by theorem 2 (see figure 
63 (iii)). 

This example is very important as it shows that there can be time- 
like singularities, that the matter can avoid hitting the singularities, 
and that it can pass through a ‘wormhole’ into another region of 
space-time or into another part of the same space-time region. Of 
course one would not expect to have such a charge density on a col- 
lapsing star, but since the Kerr solution is so similar to the Reissner— 
Nordstr6m solution one might expect that angular momentum could 
produce a similar wormhole. One might speculate therefore that prior 
to the present expansion phase of the universe there was a contraction 
phase in which local nhomogeneities grew large and isolated singu- 
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FIGURE 63 
(i) Collapse of a spherical dust cloud. 
(ii) Collapse of a charged dust cloud, where the charge is too small to prevent 
the occurrence of a singularity in the dust. 
(i111) Collapse of a charged dust cloud, where the charge is large enough to 
prevent the occurrence of a singularity in the dust cloud; the singularity occurs 
outside the dust, which bounces and re-expands into a second asymptotically 


flat space. 
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larities occurred, most of the matter avoiding the singularities and 
re-expanding to give the present observed universe. 

The fact that singularities must occur within the past of every point 
at an early time when the density was high, places limits on the 
separation of the singularities. It might be that the set of geodesics 
which hit these singularities (i.e. which are incomplete) was a set of 
measure zero. Then one might argue that the singularities would be 
physically insignificant. However this would not be the case because 
the existence of such singularities would produce a Cauchy horizon 
and hence a breakdown of one’s ability to predict the future. In fact 
this could provide a way of overcoming the entropy problem in an 
oscillating world model since at each cycle the singularity could inject 
negative entropy. 

So far, we have been exploring the mathematical consequences of 
taking a Lorentz manifold as the model for space-time, and requiring 
that the Einstein field equations (with A = 0) hold. We have shown 
that according to this theory, there should be singularities in our past 
associated with the collapse of the universe, and singularities in the 
future associated with the collapse of stars. If A is negative, the above 
conclusions would be unaffected. If A is positive, observations of the 
rate of change of expansion of the universe (Sandage, (1961, 1968)) 
indicate that A cannot be greater than 3 x 10-5 cm-?. This is equiva- 
lent to a negative energy density of 3 x 10-7’ gm cm-*. Sucha value of A 
could have an effect on the expansion of the whole universe, but it 
would be completely swamped by the positive matter density in a 
collapsing star. Thus it does not seem that a A term can enable us to 
avoid facing the problem of singularities. 

It may be that General Relativity does not provide a correct 
description of the universe. So far it has only been tested in situations 
in which departures from flat space are very small (radii of curvature 
of the order of 10! cm), Thus it is a tremendous extrapolation to apply 
it to situations like collapsing stars where the radius of curvature 
becomes less than 10°cm. On the other hand the theorems on singu- 
larities did not depend on the full Einstein equations but only on the 
property that A,,A°K® was non-negative for any non-spacelike 
vector A*; thus they would apply also to any modification of General 
Relativity (such as the Brans—Dicke theory) in which gravity is always 
attractive. 

It seems to be a good principle that the prediction of a singularity 
by a physical theory indicates that the theory has broken down, i.e. it 
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no longer provides a correct description of observations. The question 
is: when does General Relativity break down? One would expect it to 
break down anyway when quantum gravitational effects become im- 
portant; from dimensional arguments it seems that this should not 
happen until the radius of curvature becomes of the order of 10-33. em. 
This would correspond to a density of 10°4gmem~-3. However one 
might question whether a Lorentz manifold is an appropriate model for 
space-time on length scales of this order. So far experiments have 
shown that assuming a manifold structure for lengths greater than 
10-) cm gives predictions in agreement with observations (Foley et al. 
(1967)), but it may be that a breakdown occurs for lengths between 
10-* and 10-*8cm. A radius of 10-1 em corresponds to a density of 
10°°¢m cm? which for all practical purposes could be regarded as 
a singularity. Thus maybe one should construct a surface by Schmidt’s 
procedure (§8.3) around regions where the radius of curvature is less 
than, say, 10-* cm. On our side of this surface a manifold picture of 
space-time would be appropriate, but on the other side an as yet 
unknown quantum description would be necessary. Matter crossing 
the surface could be thought of as entering or leaving the universe, and 
there would be no reason why that entering should balance that 
leaving. 

In any case, the singularity theorems indicate that the General 
Theory of Relativity predicts that gravitational fields should become 
extremely large. That this happened in the past is supported by the 
existence and black body character of the microwave background 
radiation, since this suggests that the universe had a very hot dense 
early phase. 

The theorems on the existence of singularities could possibly be 
refined somewhat, but on our view they are already adequate. How- 
ever they tell us very little about the nature of the singularities. One 
would like to know what kind of singularities could occur in generic 
situations in General Relativity. A possible way of approaching this 
would be to refine the power series expansion technique of Lifshitz 
and Khalatnikov, and to clarify its validity. It may also be that there 
is some connection between the singularities studied in General 
Relativity and those studied in other branches of physics (cf. for 
instance, Thom’s theory of elementary catastrophes (1969)). Alterna- 
tively one might try to proceed by brute force, integrating the 
Einstein equations numerically on a computer. However this will 
probably have to wait for a new generation of computers. One would 
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like to know also whether the singularities produced by collapse from 
a non-singular asymptotically flat situation would be naked, i.e. 
visible from infinity, or whether they would be hidden behind an 
event horizon. 

The other main problem is to formulate a quantum theory of 
space-time which will be applicable to strong fields. Such a theory 
might be based on a manifold, or might allow changes of topology. 
Some preliminary attempts in this line have been made by de Witt 
(1967), Misner (1969, 1971), Penrose (see Penrose and MacCallum 
(1972)), Wheeler (1968), and others. However the interpretation of 
a quantum theory of space-time, and its relation to singularities, are 
still very obscure. 

Speculation and discussion on the subject of this book is not new. 
Laplace essentially predicted the existence of black holes: ‘Other 
stars have suddenly appeared and then disappeared after having 
shone for several months with the most brilliant splendour... All these 
stars... do not change their place during their appearance. Therefore 
there exists, in the immensity of space, opaque bodies as considerable 
in magnitude, and perhaps equally as numerous as the stars.’ (M. Le 
Marquis de Laplace: ‘The system of the world’. Translated by Rev. H. 
Harte. Dublin, 1830, Vol. 2, p. 335.) As we have seen, our present 
understanding of the situation is remarkably similar. 

The creation of the Universe out of nothing has been argued, 
indecisively, from early times; see for example Kant’s first Antinomy 
of Pure Reason and comments on it (Smart (1964), pp. 117-23 and 
145-59; North (1965), pp. 389-406). The results we have obtained 
support the idea that the universe began a finite time ago. However 
the actual point of creation, the singularity, is outside the scope of 
presently known laws of physics. 


Appendix A 


Translation of an essay by 
Peter Simon Laplace? 


Proof of the theorem, that the attractive force of a heavenly body could be 
so large, that light could not flow out of it.t 


(1) If v is the velocity, ¢ the time and s space which is uniformly 
moving during this time, then, as is well known, v = s/t. 


(2) Ifthe motion is not uniform, to obtain the value of v at any instant 
one has to divide the elapsed space ds and this time interval dt into 
each other, namely v = ds/dt, since the velocity over an infinitely small 
interval is constant and thus the motion can be taken as uniform. 


(3) A continuously working force will strive to change the velocity. 
This change of the velocity, namely dv, is therefore the most natural 
measure of the force. But as any force will produce double the effect 
in double the time, so we must divide the change in velocity dv by the 
time dt in which it is brought about by the force P, and one thus 
obtains a general expression for the force P, namely 


ds 
dv dt 
ieee oat 
Now if dé is constant, 
5 ds_d.ds_ dds | 
ad Te ak’ 
: dds 
accordingly P = FE: 


+ Allgemeine geographische Ephemeriden, verfasset von Einer Gesellschaft Gelehrten. 
8vo Weimer, tv, Bd 1 St. 1799. Ed. F. X. von Zach. We should like to thank D. W. 
Dewhirst for providing us with this reference. 

+ This theorem, that a luminous body in the universe of the same density as the earth, 
whose diameter is 250 times larger than that of the sun, can by its attractive power 
prevent its light rays from reaching us, and that consequently the largest bodies in 
the universe could remain invisible to us, has been stated by Laplace in his 
Exposition of the System of the World, Part 11, p. 305, without proof. Here is the 
proof. Cf. A.G.E. May 1798, p. 603. v. Z. 
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(4) Let the attractive force ofa body = M;a second body, for example 
a particle of light, finds itself at distance r; the action of the force M on 
this light particle will be — M/rr; the negative sign occurs because the 
action of M is opposite to the motion of the light. 


(5) Now according to (3) this force also equals ddr/di?, hence 


M ddr y 
ene a Me 
dr dd 
Multiplying by dr, a = — Marr; 
1dr? 
integrating, sak = 04+ Mr 


where C is a constant quantity, or 


Now by (2) dr/dt is the velocity v, accordingly 
p= 20-42 


holds, where v is the velocity of the light particle at the distance r. 


(6) To now determine the constant C, let R be the xadius of the 
attracting body, and a the velocity of the light at the distance FR, hence 
on the surface of the attracting body; then one obtains from (5) 
a? = 204+2M/R, therefore 2C = a*—2M/R. Substituting this in the 
previous equation gives 

2M 2M 
RT 


v2 = QQ? 


(7) Let R’ be the radius of another attracting body, its attractive 
power be 7M, and the velocity of the light at a distance r be v’, then 
according to the equation in (6) 
2M wM 

Sg re 
(8) If one makes r infinitely large, the last term in the previous 
equation vanishes and one obtains 

21M 
ae 


Be 
a 


ypr=a 


ESSAY BY LAPLACE 367 


The distance of the fixed stars is so large, that this assumption is 
justified. 


(9) Let the attractive power of the second body be so large that light 

cannot escape from it; this can be expressed analytically in the 

following way: the velocity v’ of the light is equal to zero. Putting this 

value of v’ in the equation (8) for v’, gives an equation from which the 

mass 7.1 for which this occurs can be derived. One has therefore 
2M _ 2M 


Or ee or a? = —r- 


(10) To determine a, let the first attracting body be the sun; then a is 
the velocity of the sun’s light on the surface of the sun. The attractive 
power of the sun is however so small in comparison with the velocity 
of light, that one can take this velocity as uniform. From the pheno- 
mena of aberration it appears that the earth travels 20” in its path 
while the light travels from the sun to the earth, accordingly: let V be 
the average velocity of the earth in its orbit, then one hasa: V = radius 
(expressed in seconds) : 20”} = 1: tang. 20". 


(11) My assumption made in Expos. du Syst. du Monde, Part 11, p. 305, 
is R’ = 250R. Now the mass changes as the volume of the attracting 
body multiplied by its density; the volume, as the cube of the radius; 
accordingly the mass as the cube of the radius multiplied by the 
density. Let the densit¥ of the sun = 1; that of the second body = p; 


unen Mae lhe ph = 1h*:p250°h* 
or 12 = 177(250)" 
or t—1(250)27. 


(12) One substitutes the values of 1 and f’ in the equation 
a? = 2i1M/R’, and thus obtains 


2(250)3pM M 

2 = iy 

a ~950R ata R 
aR 


e: P= R50 F 


(13) To obtain p, one must still determine M. The force M of the sun 
is equal at a distance D to M/D?. Let D be the average distance of the 
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earth, V the average velocity of the earth; then this force is also equal 
to V2/D (see Lande’s Astronomy, 111, §3539). Hence M/D? = V?/D or 
M = V2D. Substituting this in the equation (12) for p gives 


_  @R 8 (s)’ R 
P* 2(2502V2D  (1000)2\V (5 


a vel. of light 1 : 
V vel.ofearth tang. 20"} eoconming to), 


R bsol di 
absolute ruEene = tan average apparent radius of ©. 


D average distance of © 


Hence 
= tang. 16’ 2” 
P= © (1000 tang. 20’) 


from which p is approximately 4, or as large as the density of the earth. 


Appendix B 


Spherically symmetric solutions and 
Birkhoff’s theorem 


We wish to consider Einstein’s equations in the case of a spherically 
symmetric space-time. One might regard the essential feature of a 
spherically symmetric space-time as the existence of a world-line # 
such that the space-time is spherically symmetric about Y. Then all 
points on each spacelike two-sphere Y centred on any point p of YL, 
defined by going a constant distance d along all geodesics through p 
orthogonal to #, are equivalent. If one permutes directions at p by 
use of the orthogonal group SO(3) leaving ¥ invariant, the space-time 
is, by definition, unchanged, and the corresponding points of 4, are 
mapped into themselves; so the space-time admits the group SO(3) 
as a group of isometries, with the orbits of the group the spheres S,. 
(There could be particular values of d such that the surface “ was 
just a point p’; then p’ would be another centre of symmetry. There 
can be at most two points (p’ and 9 itself) related in this way.) 

However, there might not exist a world-line like Y in some of the 
space—times one would wish to regard as spherically symmetric. In the 
Schwarzschild and Reissner—Nordstrém solutions, for example, space— 
time is singular at the points for which r = 0, which might otherwise 
have been centres of symmetry. We shall therefore take the existence 
of the group SO(3) of isometries acting on two-surfaces like % as the 
characteristic feature of a spherically symmetric space-time. Thus we 
shall say that space-time is spherically symmetric if it admits the 
group SO(3) as a group of isometries, with the group orbits spacelike 
two-surfaces. These orbits are then necessarily two-surfaces of con- 
stant positive curvature. 

For each point q in any orbit S (q), there is a one-dimensional sub- 
group J, of isometries which leaves q invariant (when there is a central 
axis F, this is the group of rotations about p which leaves the geodesic 
pq invariant). The set @(q) of all geodesics orthogonal to /(q) at q 
locally form a two-surface left invariant by J, (since J,, which permutes 
directions in (q) about q, leaves invariant directions perpendicular 
to £(q)). At any other point r of @(q), J, again permutes directions 

[ 369 ] 


370 APPENDIX B 


orthogonal to ¢(q), as it leaves @(q) invariant; since J, must operate 
in the group orbit Y(r) through 7, this orbit is orthogonal to @(q). 
Thus (Schmidt (1967)) the group orbits are orthogonal to the 
surfaces @. Further these surfaces define locally a one-one map 
between the group orbits, where the image f(q) of g in Y(r) is the 
intersection of ¢(q) and Y(r). Since this map is invariant under J,, 
vectors of equal magnitude in Y(q) at g are mapped into vectors of 
equal magnitude in .(r) at f(g); and since all the points of /(q) are 
equivalent, the same magnitude multiplication factor occurs for the 
maps of vectors from any point in “(q) to its image in A(r). Thus 
(Schmidt (1967)) the orthogonal surfaces € map the trajectories # 
conformally onto each other. 

If one chooses coordinates {t,r, 6, ¢} so that the group orbits S are 
the surfaces {t,r = constant} and the orthogonal surfaces @ are the 
surfaces {@,¢ = constant}, it now follows that the metric takes the 
form ds? = dr*(t,r)+ Y*(é, r) dQO?(6, 6), where dr? is an indefinite two- 
surface and d{)? is a surface of positive constant curvature. If one 
further chooses the functions ¢, r so that the curves {¢ = constant}, 
{r = constant} are orthogonal in the two-surfaces @ (cf. Bergmann, 
Cahen and Komar (1965)), one can write the metric in the form 

ds? =, Figa ,r) dr? + Y(t, r) (d6? + sin? 6 d¢?). (A 1) 
(Note that this still leaves the freedom to choose arbitrarily either 
r or t in these surfaces.) 

Let an observer moving along the ¢-lines measure an energy density 
#, an isotropic pressure p, an energy flux g, and no anisotropic pres- 
sures. Then the field equations for the metric (A 1) may be written 
in the form 


“(eRe an 
eet (—xy) gp) PARP HZ) (AM 
oop = y+ 28 (PL) 2(E) re BLE (EY ay 

sin 09) = 4(-)'—w(rX) - (5) 


i \2 F 2 Y'F’ 

Se ae a 

(=) + (F) - (A5) 
where ‘ denotes @/ér and * denotes @/ét. 
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We first consider the empty space field equations R,, = 0; this means 
that in (A2)-(A5): we must set ~ = p =q = 0. The local solution 
depends on the nature of the surfaces {Y = constant}; these surfaces 
may be timelike, spacelike or null, or they may not be defined (if Y is 
constant). In the exceptional case when Y:¥,, = 0on some open set Y 
(this includes the case when Y is constant), 


= FY (A6) 


holds in Y%. However when (A 6) holds, the value of Y°’ determined 
by (A 2) is inconsistent with (A 3). Thus we may consider some point 
p where Y:¢¥., < Oor Y'°¥., > 0; the same inequality must hold in 
some open neighbourhood Y of p. 

Consider first the situation when Y:“Y,, < 0. Then the surfaces 
{Y = constant} are timelike in 7, and one can choose Y to be the 
coordinate r. (Then 7 is an area coordinate, as the area of the two- 
surfaces {r,t = constant} is 47r?.) Thus Y° = 0, Y’ = tand (A 2) shows 
that X° = 0. Further (A 4) shows that (F’/F')’ = 0, so one can choose 
a new time coordinate ¢’(t) in such a way as to set F = F(r). Then one 
has F = Fir), X = XA(r), Y =7; the solution is necessarily static. 
Equation (A 3) now shows d(r/X?)/dr = 1, so solutions are of the form 
AX? = (1—2m/r)-1 where 2m is a constant of integration. Equation 
(A 4) can be integrated, with a suitable choice of a constant of integra- 
tion, to give #? = X?, and then (A5) is identically satisfied. With 
these forms of F and X the metric (A 1) becomes 

7 


i 2 
als? == -(i-=) pee Fae tgs): (A 7) 
r i 2m 
(=) 
this is the Schwarzschild metric for r > 2m. 

Now suppose Y:“Y., > 0. Then the surfaces {Y = constant} are 
spacelike in Y%, and one can choose Y to be the coordinate t. Then 
Y'= 1, Y’= Oand (A 2) shows F’ = 0. One can choose the r-coordinate 
so that X = X(t); then F = F(t), X = X(t), Y =¢ and the solution is 
spatially homogeneous. Now (A 4) and (A 5) can be integrated to find 
the solution 

dé? 2m ’ 
ds? = —-—————- + |—- - i) dr? + #2(d6?+sin?6d¢7). (A 8) 
2m _ i) t 
t 
This is part of the Schwarzschild solution inside the Schwarzschild 
radius, for the transformation tr’, rt’ transforms this metric into 
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the form (A 7) withr’ < 2m. Finally, ifthe surfaces {Y = constant} are 
spacelike in some part of an open set V and timelike in another part, 
one can obtain solutions (A 8) and (A 7) in these parts, and then join 
them together across the surfaces where Y:*Y,,=0 as in §5.5, 

obtaining a part of the maximal Schwarzschild sallanton which lies 
in ¥Y. Thus we have proved Birkhoff’s theorem: any C? solution of 
Einstein’s empty space equations which is spherically symmetric in 
an open set V, is locally equivalent to part of the maximally extended 
Schwarzschild solution in WV. (This is true even if the space is O°, 
piecewise C1; see Bergmann, Cahen and Komar (19635).) 

We now consider spherically symmetric static perfect fluid solutions. 
Then one can find coordinates {t,r, 0, ¢} such that the metric has the 
form (A 1), the fluid moves along the f-lines (so g = 0), and F = F(r), 
X = X(r), Y = Y(r). The field equations (A 3), (A 4) now show that 
if Y’ = 0, then ~+p = 0; we exclude this as being unreasonable for 
a physical fluid, so we assume Y’ + 0. One may therefore again choose 
Y as the coordinate 7; the metric then has the form 


dé? 
ds? = — Fo) + X?(r) dr? + 7?(d6? + sin? 6 d¢?). (A 9) 
The contracted Bianchi identities T”., = 0 now shows 
p'—~ (“+ p) F'/F = 0; (A 10) 
(A 5) is identically satisfied if (A3), (A4) and (A10) are satisfied. 
Equation (A 3) can be directly integrated to show . 
2M 
xe (128) ; (A 11) 
ir 
where Mir) = a | dr, 
0 


and the boundary condition X(0) = 1 has been used (i.e. the fluid 
sphere has a regular centre). With (A 10), (A 11), equation (A 4) takes 


the form dp me (4+ >p) (Mf + 4p) ree 
dr r(r—2M) eae! 


which determines p as a function of *, if the equation of state is known. 
Finally (A 10) shows that 
pr) 
Lg) cexp| ac 
(0) + p 
where C is a constant. Equations (A 11)-(A 13) determine the metric 


inside the fluid sphere, i.e. up to the value r) of r representing the 
surface of the fluid. 


(A 13) 
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U AUB, union of A and B 
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> AcB,B>A, Ais contained in B 
— <A-—B, B subtracted from A 
€ wxe€A,isa member of A 

@ the empty set 


Maps 


go: U>YV, > maps pe Y to d(p)eV 

P(Z%) image of Y under ¢ 

g-1 inverse map to ¢ 
fog composition, g followed by f 

d»,~* mappings of tensors induced by map ¢, 22-4 


Topology 
A closure of A 


A’ boundary of A, 183 
int A interior of A, 209 


Differentiability 
C°,Cr,Ct-,C®  differentiability conditions, 11 
Manifolds 


M n-dimensional manifold, 11 


(Wa, Pa) 
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0M boundary of -4, 12 

R” Euclidean n-dimensional space, 11 
4R” lower half x! < 0 of R”, 11 

S” n-sphere, 13 

x Cartesian product, 15 


Tensors 
(2/ét),, X vectors, 15 
w,df one-forms, 16, 17 
<w,X) scalar product of vector and one-form, 16 
{E,},{E% dual bases of vectors and one-forms, 16, 17 
T%%, ,, components of tensor T of type (r,s), 17-19 
@® tensor product, 18 
A skew product, 21 
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Derivatives and connection 
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d_ exterior derivative, 17, 25 

LyY, [X, Y] Lie derivative of Y with respect to X, 27-8 
V, Vx, Tip:- covariant derivative, 30-2 
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I‘, connection components, 31 


exp exponential map, 33 
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(.4,8) manifold .@with metric g and Christoffel connection 
n volume element, 48 
Rosca Riemann tensor, 35 
R» Ricci tensor, 36 
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# curvature scalar, 41 
Crvea Weyl tensor, 41 
O(p,q) orthogonal group leaving metric G_,, invariant, 52 
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O() bundle of orthonormal frames, 52 


Space-time 


Space-time is a 4-dimensional Riemannian space (4,8) with 
metric normal form diag (+1, +1, +1, —1). Local coordinates are 
chosen to be (21, x?, x3, x4). 
T, energy momentum tensor of matter, 61 
Yipt8....¢ matter fields, 60 
L Lagrangian, 64 
Einstein’s field equations take the form 
Rap — 22RGan t+ AGan = 87 Ty; 
where A is the cosmological constant. 
(YF, w) is an initial data set, 233 


Timelike curves 
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D,/és Fermi derivative, 80-1 
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W*, W,4, 0 vorticity, 82-4 
O.p,0 shear, 83-4 


Null geodesics 
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Boundary of space-time 
M* = M\)A where Ais the c-boundary, 220 
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generalized, 259, 278, 291 
Alexandrov topology, 196 
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ies 


space—times, 
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tensors, 
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bifurcation 
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Birkhoff’s theorem, 372 
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boundary 
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tangent bundle, 51, 54 


61, 53, 54, 174, 


e-boundary, 217-21, 224-5 
canonical form, 48 
Carter’s theorem, 331 
Cartesian product, 15 
Cauchy data, 147, 231-3, 254 
Cauchy development, 6, 94, 119, 147, 
201-6, 209-11, 217, 228 
local existence, 248, 255 
global existence, 251, 255 
stability, 253, 255, 301, 310 
Cauchy horizon, 202-4, 265, 287, 362 
examples, 120, 133, 159, 178, 203, 205, 
287 
Cauchy problem, 60, 226-54 
Cauchy sequences, 257, 282 
Cauchy surface, 205, 211, 212, 263, 265, 
274, 287, 313 
examples, 119, 125, 142, 154 
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Cauchy surface (cont.) 
lack of, 133, 159, 178, 205, 206 
partial Cauchy surface, 204, 217, 
301-2, 310-20, 323 
causal boundary of space-time, 217-21, 
221-5; see also conformal structure 
causal future (past), J+(J—), 183 
causal structure, 6, 127-30, 180-225 
causally simple set, 188, 206, 207, 223 
local causality neighbourhood, 195 
causality conditions 
local causality, 60 
chronology condition, 189 
causality condition, 190 
future, past distinguishing conditions, 
192 
strong causality condition, 192 
stable causality condition, 198 
causality violations, 6, 162, 164, 170, 175, 
189, 492, 197 
and singularity theorems, 272 
caustics, 120, 132-3, 170; see 
conjugate points 
charged, scalar field, 68 
chart, 11 
Christoffel relations, 40 
chronological future (past), I+(I-), 182, 
217 
chronology condition, 189, 192, 194, 266 
violating set, 189 
cigar singularity, 144 
closed trapped surface, 2, 262, 263, 266 
examples, 155, 161 
in asymptotically flat spaces, 311, 319 
outer trapped surface, 319; marginally 
outer trapped surface, 321 
outside collapsing star, 301, 308 
in expanding universe, 353-8 
Codacci’s equation, 47, 232, 352 
collapse of star, 3, 8, 300-23, 360 
compact space-time, 40, 189 
compact space sections, 272-5 
completeness conditions 
inextendibility, 58 
metric completeness, 257 
geodesic completeness, 257 
b-completeness, 259, 278-283 
completion by Cauchy sequences, 282, 
283 
components of connection, 31 
components of tensor, 19 
of p-form, 21 
conformal curvature tensor, 41, 85; see 
Wey] tensor 
conformal metrics, 42, 60, 63, 180, 222 
conformal structure of infinity and 
singularities 
c-boundary, 217—21 


also 


INDEX 


examples, 122, 127, 132, 141, 145, 154, 
158, 160, 165, 177 
in asymptotically flat spaces, 222-4 
horizons, 128-30 
conformally flat theory, 75-6 
congruence of curves, 69 
conjugate points, 4, 5, 267 
on timelike geodesics, 97, 98, 111, 100, 
Up, Bile 
on null geodesics, 100, 101, 115, 102, 
116 
connection, 30, 31, 34, 40, 41, 59, 63 
and bundles over .#, 53-5, 277 
on hypersurface, 46 
conservation 
of energy and momentum, 61, 62,67, 73 
of matter, theorem, 94, 298 
of vorticity, 83-4 
constraint equations, 232 
continuity conditions 
for map, 11 
of space-time, 57, 284 
contraction of tensor, 19 
contracted Bianchi identities, 43 
convergence of curves, see expansion 
convergence of fields 
weak, 243 
strong, 243 
convex normal neighbourhood, 34, 60, 
103, 105, 184 
local causality neighbourhood, 195 
coordinates, 12 
normal coordinates, 34, 41 
coordinate singularities, 118, 133, 150, 
156, 163, 171, 
Copernican principle, 
350, 356, 358 ; 
cosmological constant, 73, 95, 124, 137, 
139, 168, 362 
cosmological models 
isotropic, 134-42 
spatially homogeneous, 142-9 
covariant derivatives, 31-5, 40, 59 
covering spaces, 181, 204-5, 273, 293 
cross-section of a bundle, 52 
curvature tensor, 35, 36, 41 
identities, 36, 42, 43 
of hypersurface, 47 
physical significance, 78-116 
curve, 15 
geodesic, 33, 63, 103-16, 213-17 
non-spacelike, 105, 112, 184, 185, 207, 
213 
null, 86-8 
timelike, 78-86, 103, 182, 184, 213-17 


134, 135, 142, 


de Sitter space-time, 124-31 
density of matter in universe, 137, 357 


INDEX 


development, 228, 248, 251, 253 
existence, 246-9 
deviation equation 
timelike curves, 80 
null geodesics, 87 
diffeomorphism, 22, 56, 74, 227 
differentiability conditions, 11, 12 
and singularities, 284-7 
of initial data, 251 
of space-time, 57-8 
differential of function, 17 
distance from point, 103-5 
distance function, 215 
distributional solution of field equations, 
286 
domain of dependence, see 
development, 201 
dominant energy condition, 91, 92, 94, 
BRT, PAARL By73 


Cauchy 


edge of achronal set, 202 
Einstein’s field equations, 74, 75, 77, 95, 
227-55 
constraint equations, 232 
distributional solutions, 286 
exact solutions, 117—79 
existence and uniqueness of solutions, 
248, 251, 255 
initial data, 231-3 
reduced equations, 230 
stability of solutions, 253, 255 
Einstein static universe, 139 
spaces conformal to part of, 121, 126, 
131, 139 
Einstein—de Sitter universe, 138 
electromagnetic field, 68 a 
energy conditions 
weak energy condition, 89 
dominant energy condition, 91 
null convergence condition, 95 
timelike convergence condition, 95 
strong energy condition, 95 
energy extraction from black holes, 
327-8, 332-3 
energy-momentum tensor of matter 
fields, 61, 66-71, 88-96, 255 
equation of state of cold matter, 
303-7 c 
ergosphere, 327-31 
Euler-Lagrange equations, 65 
event horizon, 729, 140, 165 
in asymptotically flat spaces, 3/2, 
315-20, 324-47 
existence of solutions 
Einstein equations with matter, 255 
empty space Einstein equations, 248, 
251 
second order linear equations, 243 


387 


exp, exponential map, 33, 103, 119 
generalized, 292 
expansion 
of null geodesics, 88, 101, 312, 319, 321, 
324, 354 
of timelike curves, 82-4, 97, 271, 356 
of universe, 137, 273, 348-59 
extension 
of development, 228, 249 
of manifold, 58: locally inextendible, 59 
of space-time, 145, 150-5, 156-9, 
163-4, 171, 175: imextendible, 58, 
141; inequivalent extensions, 171-2 
exterior derivative, 25, 35 


Fermi derivative, 80-1 
fibre bundles, see bundles 
field equations 
for matter fields, 65 
for metric tensor, 71—7 
for Wey] tensor, 85 
fluid, 69; see also perfect fluid 
focal points, see conjugate points 
forms 
one-forms, 16, 44—5 
g-forms, 21, 47-9 
Friedmann equation, 138 
Friedmann space—times, 135 
function, 74 
fundamental forms of surfaces 
first, 44, 99, 231 
second, 46, 99, 100, 102, 110, 232, 262, 
273, 274 
future 
causal, J+, 183 
chronological, J+, 182 
future asymptotically predictable, 310 
future Cauchy development, Dt, 201 
horizon, H+, 202 
future directed non-spacelike curve, 184 
inextendible, 184, 194, 268 
future distinguishing condition, 
195 
future event horizon, 129, 312 
future horismos, H+, 184 
future set, 186, 187 
future trapped set, 267, 268 


192, 


g-completeness, 257, 258 

gauge conditions, 230, 247 

Gauss’ equation, 47, 336, 352 

Gauss’ theorem, 49-50 

General Relativity, 56-77, 363 
postulates, (a), 60, (b), 61, (c), 77 
breakdown of, 362-3 

generalized affine parameter, 259, 278, 

291 
generic condition, 101, 192, 194, 266 


388 


geodesics, 33, 55, 63, 217, 284-5 
as extremum, 107, 108, 213 
see also null geodesics and timelike 
geodesics 
geodesically complete, 33, 257 
examples, 119, 126, 133, 170 
geodesically incomplete, 258, 287-9 
examples, 141-2, 155, 159, 163, 176, 
190 
see also singularities 
globally hyperbolic, 206-12, 213, 215, 
223 
Godel’s universe, 168-70 
gravitational radiation from black holes, 
313, 329, 333 


harmonic gauge condition, 230, 247 
Hausdorff spaces, 13, 56, 221, 283 
non-Hausdorff b-boundary, 283, 289- 
92 
non-Hausdorff spaces, 13, 173, 177 
homogeneity 
homogeneous space-time, 168 
spatial homogeneity, 134, 142-9, 371 
horismos, E+, 184 
horizons 
apparent horizon, 320-3, 324 
event horizon, 129, 312, 315, 319, 
324-33 
particle horizon, 128 
horizontal subspace (in bundle), 53-5, 
277-82 
lift, 54, 277 
Hoyle and Narlikar’s C-field, 90, 126 
Hubble constant, 137, 355 
Hubble radius, 351 


IF, indecomposable future set, 218 
imbedding, 23, 44, 228 

induced maps of tensors, 45 
immersion, 23 


imprisoned curves, 194-6, 261, 289- 
98 
inequalities for energy-momentum ten- 
sor, 89-96 
and second order differential equations, 
237, 240, 241 


inextendible curve, 184, 218, 280 

inextendible manifold, 58, 59, 141-2 

infinity, see conformal structure of 
infinity 

initial data, 233, 252, 254 

injective map, 23 

int, interior of set, 209 

integral curves of vector field, 27 

integration of forms, 26, 49 

intersection of geodesics, see conjugate 
points 


INDEX 


IP, indecomposible past set, 218 
isometry, 43, 56, 135-6, 142, 164, 168, 
174, 323, 326, 329, 330, 334, 340-6, 
369-70 
isotropy of observations, 134-5, 349, 358 
and universe, 351, 354 
Israel’s theorem, 326 


Jacobi equation, 80, 96 
Jacobi field, 96, 97, 99, 100 


Kerr solution, 161-8, 225, 301, 310, 327, 
332 
as final state of black hole, 325-33 
global uniqueness, 331 
Killing vector field, 43, 62, 164, 167, 300, 
323, 325, 327, 330, 339 
bivector, 167, 330, 331 
Kruskal extension of Schwarzschild 
solution, 153-5 


Lagrangian, 64-7 
for matter fields, 67-70 
for Einstein’s equations, 75 
Laplace, 2, 364, 365-8 
length of curve, 37 
generalized, 259, 280 
non-spacelike curve, 105, 213, 214, 215: 
longest curve, 5, 105, 107-8, 120, 213 
Lie derivative, 27-30, 34-5, 43, 79, 87, 
341-6 
light cone, see null cone 
limit of non-spacelike curves, 184—5 
limiting mass of star, 304™7 
Lipschitz condition, 11 
local Cauchy development theorem, 248 
local causality assumption, 60 
local causality neighbourhoods, 194 
local conservation of energy and momen- 
tum, 61 
local coordinate neighbourhood, 12 
locally inextendible manifold, 59 
Lorentz metric, 38, 39, 44, 56, 190, 
252 
Lorentz group, 52, 62, 173, 277-80 
Lorentz transformation, 279, 290-1 


m-completeness, 257, 278 
manifold, 17, 14 

as space-time model, 56, 57, 363 
map of manifold, 22, 23 

induced tensor maps, 22-4 
marginally outer trapped surface, 321 
matter equations, 59-71, 88-96, 117 

254 

maximal development, 251-252 
maximal timelike curve, 110-12 
Maxwell’s equations, 68, 85, 156, 179 


INDEX 


metric tensor, 36-44, 61, 63-4 
covariant derivative, 40, 41 
Lorentz, 38, 39, 44, 56, 57, 190, 237 
on hypersurface, 44-6, 231 
positive definite, 38, 45, 126, 257, 259, 
278, 282, 283 
space of metrics, 198, 252 
microwave background radiation, 139, 
348-50, 354, 356 
isotropy, 348-53, 358 
Minkowski space-time, 118-24, 205, 218, 
222, 274, 275, 310 
Misner’s two-dimensional 
171-4 


space-time, 


naked singularities, 311 
Newman-—Penrose formalism, 344 
Newtonian gravitational theory, 71-4, 
76, 80, 201, 303-5 
non-spacelike curve, 60, 112, 184, 185, 
207 
geodesic, 105, 213 
Nordstré6m theory, 76 
normal coordinates, 34, 41, 63 
normal neighbourhood, 34, 280; see also 
convex normal neighbourhood 
null vector, 38, 57 
cone, 38, 42, 60, 103-85, 
reconverging, 266, 354 
convergence condition, 95, 192, 263, 
265, 311, 318, 320 
geodesics, 86-8, 103, 105, 116, 133, 
171, 184, 188, 203, 204, 258, 312, 
319, 354: reconverging, 267, 271, 
354, 355; closed null geodesics, 
190-1, 290 ie 
hypersurface, 45 


184, 198: 


4 


optical depth, 355, 357, 359 
orientable manifold, 73 

time orientable, 181, 182 

space orientable, 181, 182 
orientation 

of boundary, 27 

of hypersurface, 44 
orthogonal group O(p, q), 52, 277-83 
orthogonal vectors, 36 
orthonormal basis, 38, 52, 54, 80-2, 

276-83, 291 

pseudo-orthonormal basis, 86-7, 344 

outer trapped surface, 319, 320 


pancake singularity, 144 
paracompact manifold, 14, 34, 38, 57 
parallel transport, 32, 40, 277 
non-integrability, 35, 36 
p-p. singularity, 260, 290, 291 
parallelizable manifold, 52, 182 


389 


partially imprisoned non-spacelike curve, 
194, 289-92 
partial Cauchy surface, 204, 217, 265, 
274, 295, 301 
and black holes, 310-24 
particle horizon, 128, 140, 144 
past, dual of future, 183; thus past set 
4s dual of future set, 186 
PIPs, PIFs, 218 
Penrose collapse theorem, 262 
Penrose diagram, 123 
perfect fluid, 69-70, 79, 84, 136, 143, 168, 
305, 372 
plane-wave solutions, 178, 188, 206, 260 
postulates for special and _ general 
relativity 
space-time model, 56 
local causality, 60 
conservation of energy and momentum, 
61 
metric tensor, 71, 77 
p-p. curvature singularity, 260, 289-92 
prediction in General Relativity, 205-6 
product bundle, 50 
propagation equations 
expansion, 84, 88 
shear, 85, 88 
vorticity, 83, 88 
properly discontinuous group, 173 
pseudo-orthonormal basis, 86-7, 
114, 271, 290, 344 


102, 


rank of map, 23 

Raychaudhuri equation, 84, 97, 136, 275, 
286, 352 

redshift, 129, 139, 144, 161, 309, 355, 358 

regular predictable space, 318, 323 

Reissner—Nordstr6m solution, 156-61, 
188, 206, 225, 310, 360-1 

global uniqueness, 326 

Ricci tensor, 36, 41, 72—5, 85, 88, 95, 290, 
352 

Riemann tensor, 35, 36, 41, 85, 290, 352 

Robertson—Walker spaces, 134-42, 276, 
352-7 


scalar field, 67, 68, 95; see also Brans— 
Dicke 
scalar polynomial curvature singularities, 
141-2, 146, 151, 260, 289 
Schwarzschild solution, 149-56, 225, 262, 
310, 316, 326 
local uniqueness, 371 
global uniqueness, 326 
outside star, 299, 306, 308-9, 316, 360 
Schwarzschild radius, 299, 300, 307-8, 353 
mass, 306, 309 
length, 353, 358 


390 


second fundamental form of hypersur- 
surface, 46, 47 
of 3-surface, 99, 273, 274 
of 2-surface, 102, 262 
second order hyperbolic equation, 233-43 
second variation, 108, 110, 114, 296 
semispacelike set, see achronal 
186 
separation of timelike curves, 79, 96, 99 
of null geodesics, 86-7, 102 
shear tensor, 82, 85, 88, 97, 324, 351 
singularity, 3, 256-61, 360-4 
s.p. singularity, 260, 289 
p.p. singularity, 260, 290-2 
examples, 137-42, 144-6, 150-1, 159, 
162, 171-4, 177 
theorems, 7, 147, 263, 266, 271, 272, 
274, 285, 288, 292 
description, 276-84 
nature, 284-9, 360-1, 363 
in collapsing stars, 308, 310, 311, 360-1 
in universe, 355, 358-9 
singularity-free space—times, 258, 260 
examples, 119, 126, 133, 139, 170, 
305-6 
skew symmetry, 20-1 
Sobolev spaces, 234 
S.p. curvature singularity, 141-2, 146, 
151, 260, 289 
spacelike hypersurface, 45 
spacelike three-surface, 99, 170, 201, 204, 
313 
spacelike two-surface, 101, 262 
spacelike vector, 38, 57 
space-orientable, 78] 
space-time manifold, 4, 14, 56, 57 
breakdown, 363 
connection, 41, 59, 63 
differentiability, 57, 58, 284—7 
inextendible, 58 
metric, 56, 60, 227 
non-compact, 190 
space and time orientable, 181-2 
topology, 197 
spatially homogeneous, 
sil 
Special Relativity, 60, 62, 71, 118 
speed of light, 60, 61, 94 
spinors, 52, 59, 182 
spherically symmetric solutions, 
149-61, 299, 305-6, 369-72 
stable causality, 198 
stability 
of Einstein’s equations, 253, 255, 301 
of singularity, 273, 360 
star, 299-308 
white dwarfs, neutron stars, 304, 307 
life history, 301, 307-8 


set, 


134, 142-9, 


135, 


INDEX 


static space—times, 72, 73 
spherically symmetric, 149-61, 305-6, 
371 
regular predictable space-times, 325-9 
stationary axisymmetric solutions, 161— 
70 
stationary regular predictable space- 
times, 323-47 
stationary limit surface, 165-167, 328, 331 
steady-state universe, 90, 126 
Stokes’ theorem, 27 
strong causality condition, 192, 194, 195, 
208, 209, 217, 222, 261, 267, 271 
strong energy condition, 95 
strongly future asymptotically predict- 
able, 313, 317, 318 
summation convention, 15 
symmetric and skew-symmetric tensors, 
20-1 
symmetries of space-time, 44 
axial symmetry, 329 
homogeneity, 168 
spatial homogeneity, 135, 142 
spherical symmetry, 369 
static spaces, 72, 325 
stationary spaces, 323 
time-symmetry, 326 


tangent bundle, 51, 53—4, 292, 351 
tangent vector space, 16, 51 
dual space, 17 
Taub-NUT space, 170-8, 206, 261, 289- 
92 


tensor of type (r,s), 17 ™ 

field of type (7, 6), 27 

bundle of tensors of type (r,s), 51 
tensor product, 78 


theorems 
conservation theorem, 94 
singularities in homogeneous cos- 


mologies, 147 
local Cauchy development, 248 
global Cauchy development, 251 
Cauchy stability theorem, 253 
singularity theorems: theorem 1, 263; 
theorem 2, 266; theorem 3, 271; 
theorem 4, 272; theorem 5, 292; 
weakened conditions, 285, 288 
tidal force, 80 
TIFs, TIPs, 278 
time coordinates, 170, 198 
time orientable, 131, 187, 182 
time symmetric, 326, 328 
black hole, 330 
timelike convergence conditions, 95, 265, 
266, 271, 272, 285, 363 
timelike curves, 69, 79-85, 103, 184, 
213-15, 218 


INDEX 


timelike geodesics, 
111-12, 
288 

timelike hypersurface, 44 

timelike singularity, 159, 360-1 

timelike vector, 38, 57 

topology of manifold, 12-14 

Alexandrov topology, 196, 197 

topology of set of Lorentz metrics, 198, 
252 

topology of space of curves, 208, 214 

torsion tensor, 34, 41 

totally imprisoned curves, 194, 195, 289— 
98 

trapped region, 319-20 

trapped set, 267 

trapped surface, see closed trapped sur- 
face 


63, 96-100, 103, 
133, 159, 170, 217, 258, 


uniqueness of solutions 
of Einstein’s equations: locally, 246, 
255; globally, 251, 255 
of second order linear equations, 239, 
243 
universe, 3, 348—59, 360, 362, 364 


391 


spatially homogeneous universe models 
anisotropic, 142-9; isotropic, 134— 
42, 351-3, 356-7 


vacuum solutions of field equations, 118, 
150, 161, 170, 178, 244-54 
variation 
of fields in Lagrangian, 65 
of timelike curve, 106-10, 295 
of non-spacelike curves, 112-16, 191 
vector, 15, 16, 38, 57 
field, 21, 27, 51, 52, 54, 55, 277, 278 
variation vector, 107—16, 191, 275, 295 
see also Killing vector 
vertical subspaces in bundles, 53, 277 
volume, 48, 49 
vorticity 
of Jacobi fields, 97 
of null geodesics, 88 
of timelike curves, 82—4, 352 


weak energy condition, 89, 94 

weakly asymptotically simple and empty 
spaces, 225, 310 

Weyl tensor, 47, 42, 85, 88, 101, 224, 344 
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